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DeacriptioT^ 

5 PRODUCTION OP PIBRIN06EN IN TRANSGENIC ANMALS 

Backcrround of th^ y nvention 

The final step in the blood coagulation cascade 
is the thrombin-catalyzed conversion of the soltible plasma 

IC protein fibrinogen to insoluble fibrin. Thrombin cleaves 
a small peptide (fibrinopeptide A) from one of the three 
component chains (the Aa>chain) of fibrinogen. Fibrin 
monomers subsequently polymerize and are cross-linked by 
activated factor XIII to form a stable clot. 

Pibrinogen is a key coiq>onent of biological 
tissue glues (see, e.g., U.S. Patents Nos. 4,377,572 and 
4,442,655), which mimic the formation of natural blood 
clots to promote hemostasis and repair damaged tissue. 
Tissue glues provide an adjuct or alternative to sutures, 

20 staples and other mechanical means for wound closure- 
However, the principal ingredients of these products 
(fibrinogen, factor XIII and thrombin) are prepared from 
pooled human plasma by cryoprecipitation (e.g. U.S. 
Patents No. 4,377,572; 4,362,567; 4,909,251) or ethanol 

25 precipitation (e.g. U.S. Patent No. 4,442,655) or from 
single donor plasma (e.g. U.S. Patent No. 4,627,879; 

Spotnitz et al., Sura. §5: 166-168, 1989). The 

resultant f ibrinogen/factor XIII preparation is mixed with 
bovine thrombin immediately before use to convert the 

30 fibrinogen to fibrin and activate the factor XIII, thus 
initiating coagulation of the adhesive. 

Commercially available adhesives are of pooled 
plasma origin. Because blood-derived products have been 
associated with the transmission of human immtinodef iciency 

35 virus (HIV), hepatitis virus and other etiologic agents, 
the acceptance and availability of such adhesives is 
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limited. At present they are not approved for use in the 
United States, 

While the ubb of autologous plasma reduces the 
risk of disease transmission, autologous adhesives can 

5 only be used in elective surgery when the patient is able 
to donate the necessary blood in advemce. 

As noted above, fibrinogen consists of three 
polypeptide chains, each of which is present in two copies 
in the assembled molecule. These chains, designated the 

0 Aa» and 7-chains, are coordinately expressed, assembled 
and secreted by the liver. While it might be expected 
that recombinant DNA technology could provide an 
alternative to the isolation of fibrinogen from plasma, 
this goal has proven to be elusive. The three fibrinogen 

5 chains have been individually expressed in E. coli. (Lord, 
EHA A: 33-38, 1985; Bolyard and Lord, Gene 66 ; 183-192, 
1988; Bolyard and Lord, Blood 73: 1202-1206) , but 
functional fibrinogen has not been produced in a 
prokeuryotic system. Expression of biologically competent 

0 fibrinogen in yeast has not been reported . Cultvired 
transfected mammalian cells have been used to express 
biologically active fibrinogen (Parrell et al.. Blood 74; 
55a, 1989; Hartwig and Danishefsky, J. Biol. Chem. 266; 
6578-6585, 1991; Parrell et al.. Biochemistry Ifi: 9414- 

5 9420, 1991), but expression levels have been so low that 
production of recombinant fibrinogen in commercial 
quantities is not feasible. Experimental evidence 
suggests that lower transcription rates in cultured cells 
as compeared to liver may be a factor in the low expression 

0 rates achieved to date, but increasing the amount of 
fibrinogen chain mRNA in transfected BHK cells did not 
produce corresponding increases in fibrinogen protein 
secretion (Prunkard and Poster, XIV Congress of the 
International Society on Thrombosis and Haemostasis, 

5 1993). These latter results suggest that proper assembly 
and processing of fibrinogen involves tissue-specific 
mechanisms not present in coimnon laboratory cell lines. 
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There remains a need in the art for methods of 
producing large quantities of high quality fibrinogen for 
use in tissue adhesives and other applications. There is 
a further need for fibrinogen that is free of blood-borne 
5 pathogens. The present invention fulfills these needs and 
provides other, related advantages. 

Snunnar-Y »f Invention 

It is em object of the present invention to 

10 provide commercially useful quantities of recombinant 
fibrinogen, particularly recombinant human fibrinogen. It 
is a further object of the invention to provide materials 
and methods for expressing fibrinogen in the mammary 
tissue of transgenic animals, particularly livestock 

15 animals such as cattle, sheep, pigs and goats. 

Within one aspect, the present invention 
provides a method for producing fibrinogen comprising (aj 
providing a first DNA segment encoding a secretion signal 
operably linked to a fibrinogen Aa chain, a second DNA 

20 segment encoding a secretion signal operea>ly linked to a 
fibrinogen chain, and a third DNA segment encoding a 
secretion signal operably linked to a fibrinogen y chain, 
wherein each of the first, second and third segments is 
operably linked to additional DNA segments required for 

25 its expression in the mammary gland of a host female 
mammal; (b) introducing the DNA segments into a fertilized 
egg of a non-human mammaliam species; (c) inserting the 
egg into an oviduct or uterus of a female of the species 
to obtain offspring carrying the DNA constructs; (d) 

30 breeding the offspring to produce female progeny that 
express the first, second and third DNA segments and 
produce milk containing biocompetent fibrinogen encoded by 
the segments; (e) collecting milk from the female progeny; 
and (f) recovering the fibrinogen from the milk. Within 

35 one embodiment, the egg containing the introduced segments 
is cultured for a period of time prior to insertion. 
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Within another aspect, the invention provides a 
method of producing fibrinogen comprising the steps of (a) 
incorporating a first DNA segment encoding a secretion 
signal operably linked to an Aa chain of fibrinogen into a 
5 ^-lactoglobulin gene to produce a first gene fusion; (b) 
incorporating a second DNA segment encoding a secretion 
signal operably linked to a chain of fibrinogen into a 
iS-lactoglobulin gene to produce a second gene fusion; (c) 
incorporating a third DNA segment encoding a secration 

10 signal operably linked to a 7 chain of fibrinogen into a ^- 
lactoglobulin gene to produce a third gene fusion; (d) 
introducing the first, second and third gene fusions into 
the germ line of a non-hiraan mamiaal so that the DNA 
segments are expressed in a mammary glsmd of the mammal or 

15 its female progeny amd biocompetent fibrinogen is secreted 
into milk of the mammal or its female progeny; (e) 
obtaining milk from the xaammal or its female progeny; and 
(f) recovering the fibrinogen from the milk. Within 
prefeirred embodiments, the mammal is a sheep, pig, goat or 

20 bovine. 

Within another aspect, the invention provides a 
method for producing fibrinogen comprising the steps of 
(a) providing a transgenic female non-human mammal 
carrying in its germline heterologous DNA segments 

25 encoding Aa, and 7 chains of fibrinogen, %4ierein the DNA 
segments are expressed in a mammary gland of the mammal 
and fibrinogen encoded by the DNA segments is secreted 
into milk of the mammal; (b) collecting milk from the 
mammal; and (c) recovering the fibrinogen from the milk. 

30 Within another aspect, the invention provides a, 

non-humcin mammalian embryo containing in its nucleus 
heterologous DNA segments encoding Aa, B^ and 7 chains of 
fibrinogen. Within a related aspect, the invention 
provides a transgenic non-human female mammal that 

35 produces recoveredble amounts of hximan fibrinogen in its 
milk. 
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Within another aspect, the invention provides a 
method for producing a transgenic offspring of a meunmal 
comprising the steps of (a) providing a first DNA segjient 
encoding a fibrinogen Aft chain, a second DNA segment 
5 encoding a fibrinogen chain, and a third DNA segment 
encoding a fibrinogen 7 chain, wherein each of said first, 
second and third segments is operably linked to additional 
DNA segments required for its expression in a mammary 
gland of a host female mammal and secretion into milk of 

10 the host female mammal; (b) introducing the DNA segments 
into a fertilized egg of a mammal of a non-human species ; 
(c) inserting the egg into an oviduct or uterus of a 
female of the non-human species to obtain an offspring 
carrying the first, second and third DNA segments. In a 

15 related aspect, the invention provides non-human mammals 
produced according to this process. 

Within an additional aspect, the invention 
provides a non-human mammal carrying its germline DNA 
segments encoding heterologous Aa, and 7 chains of 

20 fibrinogen, wherein female progeny of the mammal express 
the DNA segments in a mammary gland to produce 
biocompetent fibrinogen. 

These and other aspects of the invention will 
become evident to the skilled practitioner upon reference 

25 to the following detailed description and the attached 
drawings. 
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Figure 1 illustrates the subcloning of a hxman 
fibrinogen Aft chain DNA sequence. 

Figure 2 is a partial restriction map of the 
5 vector Zem228. Syiobols used are MT-lp, mouse 

metallothionein promoter; SV40t, SV40 terminator; and 
SV40p, SV40 promoter. 

Figure 3 illustrates the subcloning of a human 
fibrinogen chain DNA sequence. 
10 Figure 4 illustrates the subcloning of a human 

fibrinogen 7 chain DNA sequence. 

Figure 5 is a partial restriction map of the 
vector Zem219b. Symbols used are MT-lp, mouse 

metallothionein promoter; hGHt, human growth hormone 
15 terminator ; SV4 Op , SV4 0 promoter ; DHFR , dihydrof olate 
reductase gene; and SV40t, SV40 terminator. 

Detailed Description of the Invention 

Prior to setting forth the invention in detail, 
20 it will be helpful to define certain terms used herein: 

As used herein, the term ""biocompetent 
fibrinogen** is used to denote fibrinogen that polymerizes 
when treated with thrombin to form insoluble fibrin. 

The term **egg** is used to denote an imfertilized 
25 ovum, a fertilized ovum prior to fusion of the pronuclei 
or an early stage embryo (fertilized ovum with fused 
pronuclei) . 

A ** f ema le laamma 1 that produces mi Ik conta ining 
biocompetent fibrinogen** is one that, following pregnancy 
30 and delivery, produces, during the lactation period, milk 
containing recover2UE>le amounts of biocompetent fibrinogen. 
Those skilled in the art will recognized that such animals 
will produce milk, and therefore the fibrinogen, 
discontinuously . 

35 The term ••progeny is used in its usual sense to 

include children and descendemts . 



wo 95/2^68 PCr/DS9S/02648 



The term ••heterologous" is used to denote 
genetic xaaterial originating from a different species than 
that into which it has been introduced, or a protein 
produced from such genetic material. 
5 Within the present invention, transgenic animal 

technology is employed to produce fibrinogen within the 
mammary glands of a host female mammal. Expression in the 
mammary gland and subsequent secretion of the protein of 
interest into the milk overcomes many difficulties 

10 encountered in isolating proteins from other sources. 
Milk is readily collected, available in large quantities, 
and well characterized biochemically. Furthermore, the 
major milk proteins are present in milk at high 
concentrations (from about 1 to 15 g/1) . 

15 Prom a commercial point of view, it is clearly 

preferable to use as the host a species that has a large 
milk yield. While smaller animals such as mice and rats 
can be used (and are preferred at the proof -of -concept 
stage) , within the present invention it is preferred to 

20 use livestock mammals including, but not limited to, pigs, 
goats, sheep and cattle. Sheep are particularly preferred 
due to such factors as the previous history of 
transgenesis in this species, milk yield, cost and the 
ready availability of equipment for collecting sheep milk. 

25 See WO 88/00239 for a con^arison of factors influencing 
the choice of host species. It is generally desirable to 
select a breed of host emimal that has been bred for dairy 
use, such as East Friesland sheep, or to introduce dairy 
stock by breeding of the transgenic line at a later date. 

30 In any event, animals of known, good health status should 
be used. 

Fibrinogen produced according to the present 
invention may be human fibrinogen or fibrinogen of a non- 
human animal. For medical uses, it is preferred to employ 
35 proteins native to the patient. The present invention 
thus provides fibrinogen for use in both human and 
veterinary medicine. Cloned DNA molecules encoding the 
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coiqponent chains of human fibrinogen are disclosed by 
Rixon et al, f Biochcm> 22: 3237, 1983), Chung et al. 
(Bioch^^ 22: 3244, 1983), Chung et al. ( Biochem. 22.: 
3250, 1983), Chung et al. f Adv> Exd> Med. Biol, 281 : 39- 
5 48, 1990) and Chung et al. f Ann. NY Acad. Sci, 4fi8: 449- 
456, 1983). Bovine fibrinogen clones are disclosed by 
Brown et al. fNuc. Acids Res. 12: 6397, 1989) and Chung et 
al. fProc. Natl. Acad. Sci. USA 28: 1466-1470, 1981). 
Other mainmalian fibrinogen clones are disclosed by 

10 Murakawa et al. f Thromb. Haemost. 69 : 351-360, 1993). 
Representative sequences of human Aa, B^ and 7 chain genes 
are sho%m in SEQ ID NOS: 1, 3 and 5, respectively. Those 
skilled in the art will recognize that allelic variants of 
these sequences will exist; that additional variants can 

15 be generated by amino acid sxibst itution , deletion , or 
insertion; and that such variants are useful within the 
present invention. In general, it is preferred that any 
engineered variants comprise only a limited number of 
amino acid substitutions, deletions, or insertions, and 

20 that any substitutions are conservative. Thus, it is 
preferred to produce fibrinogen chain polypeptides that 
are at least 90%, preferably at least 95%, and more 
preferably 99% or more identical in sequence to the 
corresponding native chains. The term chain" is meant 

25 to include the alternatively spliced 7' chain of 
fibrinogen (Chung et al., Biochem. 23 : 4232-4236, 1984). 
A human 7* chain amino acid sequence is shown in SEQ ID 
NO: 6. The shorter 7 chain is produced by alternative 
splicing at nucleotides 9511 and 10054 of SEQ ID NO: 5, 

30 resulting in translation terminating after nucleotide 
10065 of SEQ ID NO: 5. . 

To- obtain expression in the m2unmary gland, a 
transcription promoter from a milk protein gene is used. 
Milk protein genes include those genes encoding caseins, 

35 beta-lactoglobulin (BLG) , a-lactalbumin, and whey acidic 
protein. The beta-lactoglobulin promoter is preferred. 
In the case of the ovine beta-lactoglobulin gene, a region 
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Of at least the proximal 406 bp of 5' fl£mklng sequence of 
the ovine BLG gene (contained within nucleotides 3844 to 
4257 of SEQ ID NO: 7) will generally be used. Larger 
portions of the 5* flanking sequence, up to about 5 kbp, 
5 are preferred. A larger DNA segment encompassing the 5* 
flanking promoter region and the region encoding the 5' 
non-coding portion of the beta-lactoglobulin gene 
(contained within nucleotides 1 to 4257 of SEQ ID NO: 7) is 
particularly preferred.^ See Whitelaw et al., Biochsm J, 

10 ?86: 31-39, 1992. Similar fragments of promoter DNA from 
other species are also suitable. 

Other regions of the beta-lactoglobulin gene may 
also be incorporated in constructs, as may genomic regions 
of the gene to be expressed. It is generally accepted in 

15 the art that constructs lacking introns, for example, 
express poorly in comparison with those that contain such 
DNA sequences (see Brinster et al., Proc. Natl. Acad. Sci. 
USA 8S: 836-840, 1988; Palmiter et al., Proc. Natl. Acad. 
ppj. USA M: 478-482, 1991; Whitelaw et al.. Transgenic 

20 Rep, 1: 3-13, 1991; WO 89/01343; WO 91/02318). In this 
regard, it is generally preferred, where possible, to use 
genomic sequences containing all or some of the native 
introns of a gene encoding the protein or polypeptide of 
interest. Within certain embodiments of the invention, 

25 the ftirther incltision of at least some introns from the 
beta-lactoglobulin gene is preferred. One such region is 
a DNA segment which provides for intron splicing and RNA 
polyadenylation from the 3» non-coding region of the ovine 
beta-lactoglobulin gene. When substituted for the natural 

30 3« non-coding sequences of a gene, this ovine beta- 
lactoglobulin segment can both enhance and stabilize 
expression levels of the protein or polypeptide of 
interest. Within other embodiments, the region 

surrounding the initiation ATG of one or more of the 

35 fibrinogen sequences is replaced with corresponding 
sequences from a milk specific protein gene. Such 
replacement provides a putative tissue-specific initiation 
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environment to enhance expression. It is convenient to 
replace the entire fibrinogen chain pre-pro cuid 5* non- 
coding sequences with those of, for example, the BLG gene, 
although smaller regions may be replaced. 
5 For expression of fibrinogen, DNA segments 

encoding each of the three component polypeptide chains of 
fibrinogen are operably linked to additional DNA segments 
required for their expression to produce expression units. 
Such additional segments include the above-mentioned milk 

10 protein gene promoter, as well as sec[uences which provide 
for termination of transcription and polyadenylation of 
mRNA. The expression units will fvurther include a DNA 
segment encoding a secretion signal operably linked to the 
segment encoding the fibrinogen polypeptide chain. The 

15 secretion signal laay be a native fibrinogen secretion 
signal or may be that of another protein, such as a milk 
protein. The term ''secretion signal" is used herein to 
denote that portion of a protein that directs it through 
the secretory pathway of a cell to the outside. Secretion 

20 signals are most commonly found at the amino-termini of 
proteins. See, for example, von Heinje, Nuc. Acids Res> 
li: 4683-4690, 1986; and Meade et al., U.S. Patent No. 
4,873,316, which are incorporated herein by reference. 

Construction of expression units is conveniently 

25 carried out by inserting a fibrinogen chain sequence into 
a plasmid or phage vector containing the additional DNA 
segments, although the expression unit may be constructed 
by essentially any sequence of ligations. It is 
p£urticularly convenient to provide a vector containing a 

30 DNA segment encoding a milk protein and to replace the 
coding sequence for the milk protein with that of a 
fibrinogen chain (including a secretion signal) , thereby 
creating a gene fusion that includes the expression 
control sequences of the milk protein gene. In any event, 

3 5 cloning of the expression units in plasmids or other 
vectors facilitates the amplification of the fibrinogen 
sequences. Amplification is conveniently carried out in 
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bacterial (e.g. E. coli) host cells, thus the vectors will 
typically include an origin of replication and a 
selectable marker functional in bacterial host cells. 

In view of the size of the fibrinogen chain 
5 genes it is most practical to prepare three separate 
expression units, mix them, and introduce the mixture into 
the host. However, those skilled in the art will 
recognize that other protocols may be followed. For 
example, expression units for the three chains can be 

10 introduced individually into different embryos to be 
combined later by breeding. In a third approach, the 
three expression units can be linked in a single suitable 
vector, such as a yeast artificial chromosome or phage Pi 
clone. Coding sequences for two or three chains can be 

15 combined in polycistronic expression units (see, e.g., 
Levinson et al., U.S. Patent No. 4,713,339). 

The expression unit(s) is (are) then introduced 
into fertilized eggs (including early-stage embryos) of 
the chosen host species. Introduction of heterologous DNA 

20 can be accomplished by one of several routes, including 
microinjection (e.g. U.S. Patent No. 4,873,191), 
retroviral infection (Jaenisch, SsisDSS. 2A0: 1468-1474, 
1988) or site-directed integration using embryonic stem 
(ES) cells (reviewed by Bradley et al.. Bio /Technology ^0: 

25 534-539, 1992). The eggs are then implanted into the 
oviducts or uteri of pseudopregnant females and allowed to 
develop to term. Offspring carrying the introduced DNA in 
their germ line can pass the DNA on to their progeny in 
the normal, Mendelian fashion, allowing the development of 

30 transgenic herds. General procedures for producing 
transgenic, animals are known in the art. See, for 
example, Hogan et al.. Manipulat ing the mqus«> RmhT-ynt a 
JLaboratory ManuaJ-r Cold Spring H<urbor Laboratory, 1986; 
Simons et al., Bio/Technolooy 6: 179-183, 1988; Wall et 

35 al., pio;. R^pypd. 1^: 645-651, 1985; Buhler et al., 

pio/yechnpHoqy I: 140-143, 1990; Ebert et al., 
pjo/T^phnQlogy £: 835-838, 1991; Krimpenfort et al., 
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Bio/Technoloav a: 844-847, 1991; Wall et al., J. Cell> 
Biochem, 49; 113-120, 1992; and HIPO publications WO 
88/00239, WO 90/05188, WO 92/11757; and GB 87/00458, which 
are incorporated herein by reference. Techniques for 
5 introducing foreign DNA sequences into mammals and their 
germ cells were originally developed in the mouse. See, 
e.g., Gordon et al., Proc. Natl. Acad. Sci. USA 77: 7380- 
7384, 1980; Gordon and Ruddle, Science 214 ; 1244-1246, 
1981; Palmiter and Brinster, Cell jii 343-345, 1985; 

10 Brinster et al. , Proc. Natl. Acad, Sci, USA 82: 4438-4442, 
1985; and Hogan et al. (ibid.). These technicpies were 
subsequently adapted for use with larger animals, 
including livestock species (see e.g., WIPO publications 
wo 88/00239, WO 90/05188, and WO 92/11757; and Simons et 

15 al., Bio/Technoloav 6: 179-183, 1988). To summarize, in 
the most efficient route used to date in the generation of 
transgenic mice or livestock, several hundred linear 
molecules of the DNA of interest are injected into one of 
the pro-nuclei of a fertilized egg. Injection of DNA into 

20 the cytoplasm of a zygote can also be employed. 

It is preferred to obtain a balanced expression 
of each fibrinogen chain to allow for efficient formation 
of the mature protein. Ideally, the three expression 
units should be on the same DNA molecule for introduction 

25 into eggs. This approach, however, may generate technical 
problems at, for example, the injection and manipulation 
stages. For example, the size of fibrinogen expression 
units may necessitate the use of yeast artificial 
chromosomes (YACs) or phage PI to amplify and memipulate 

30 the DNA prior to injection. If this approach is followed, 
segments of DNA to be injected, containing all three 
expression units, would be very large, thus requiring 
modification of the injection procedure using, for 
example, larger bore needles. In a more simple approach, 

35 a mixture of each individual expression unit is used. It 
is preferred to combine equimolar amotints of the three 
expression units, although those skilled in the art will 
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recognize that this ratio may be varied to compensate for 
the characteristics of a given expression unit* Some 
expression, generally a reduced level, will be obtained 
when lesser molar amounts of one or two chains are used, 
5 and expression efficiencies can generally be expected to 
decline in approximate proportion to the divergence from 
the preferred eguimolar ratio. In any event, it is 
preferred to use a mixture having a ratio of Aa:B^:7 
expression units in the range of 0.5-1: 0.5-1:0. 5-1 • When 

10 the ratio is varied from eguimolar, it is preferred to 
employ relatively more of the expression unit. 

Alternatively, one or a mixture of two of the expression 
xinits is introduced into individual eggs. However, 
animals derived by this approach will express only one or 

15 two fibrinogen chains. To generate an intact fibrinogen 
molecule by this approach requires a subsequent breeding 
program detsigned to combine all three expression units in 
individuals of a group of animals. 

In general, female animals are superovulated by 

20 treatment with follicle stimulating hormone, then mated. 
Fertilized eggs are collected, and the heterologous DNA is 
injected into the eggs using known methods. See, for 
example, tJ.S. Patent No. 4,873,191; Gordon et al., Proc. 
Us£Lz — Acad. Sci. PSA 22: 7380-7384, 1980; Gordon and 

25 Ruddle, Science 214: 1244-1246, 1981; Palmiter and 
Brinster, Cell 41: 343-345, 1985; Brinster et al., Proc. 
Natl. Acad, Sci, PSA 8^: 4438-4442, 1985; Hogan et al.. 
Manipulating the Mouse Ti ^hrvo: A Laboratory Manual . Cold 
Spring Harbor Laboratory, 1986; Simons et al. 

30 Bio/yechy>o^oqY 6: 179-183, 1988; Wall et al., Biol. 
yeprod. 2^: 645-651, 1985; Buhler et al., Bio/TechnolooY 
8: 140-143, 1990; Ebert et al., Bio /Technology ^: 835-838, 
1991; Krimpenfort et al.. Bio /Technology 9: 844-847, 1991; 
Wall et al., J. Cell, Biochem, 49; 113-120, 1992; WIPO 

35 publications WO 88/00239, WO 90/05118, and WO 92/11757; 
and GB 87/00458, which are incorporated herein by 
reference. 
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For injection into fertilized eggs, the 
expression units are removed from their respective vectors 
by digestion with appropriate restriction enzymes. For 
convenience, it is preferred to design the vectors so that 
5 the expression units are removed by cleavage with enzymes 
that do not cut either within the expression units or 
elsewhere in the vectors. The expression units are 
recovered by conventional methods, such as electro-elution 
followed by phenol extraction and ethanol precipitation, 

10 sucrose density gradient centrifugation, or combinations 
of these approaches. 

DMA is injected into eggs essentially as 
described in Hogan et al., ibid. In a typical injection, 
eggs in a dish of an embryo culture medium are located 

15 using a stereo zoom microscope (x50 or x63 magnification 
preferred). Suitable media include Hepes (N-2- 

hydroxyethylpiperazine-N'-2-ethanesulphonic acid) or 
bicarbonate buffered media such as M2 or M16 (available 
from Sigma Chemical Co., St. Louis, USA) or synthetic 

20 oviduct medium (disclosed below). The eggs are secured 
and transferred to the center of a glass slide on an 
injection rig using, for example, a drummond pipette 
coB^lete with capillary tube. Viewing at lower (e.g. x4) 
magnification is used at this stage. Using the holding 

25 pipette of the injection rig, the eggs are positioned 
centrally on the slide. Individual eggs are sequentially 
secured to the holding pipette for injection. For each 
injection process, the holding pipette/ egg is positioned 
in the center of the viewing field. The injection needle 

30 is then positioned directly below the egg. Preferably 
using x40 Nomarski objectives, both manipulator heights 
are adjusted to focus both the egg and the needle. The 
pronuclei are located by rotating the egg and adjusting 
the holding pipette assembly as necessary. once the 

35 pronucleus has been located, the height of the manipulator 
is altered to focus the pronuclear membrane. The 
injection needle is positioned below the egg such that the 
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needle tip is in a position below the center of the 
pronucleus. The position of the needle is then altered 
using the injection manipulator assembly to bring the 
needle and the pronucleus into the seone focal plane. The 
5 needle is moved , via the j oy stick on the in j ect ion 
manipulator assembly, to a position to the right of the 
egg. With a short, continuous jabbing movement, the 
pronuclear membrane is pierced to leave the needle tip 
inside the pronucleus. Pressure is applied to the 

10 injection needle via the glass syringe until the 
pronucleus swells to approximately twice its volume. At 
this point, the needle is slowly removed. Reverting to 
lower (e.g. x4) magnification, the injected egg is moved 
to a different area of the slide, and the process is 

15 repeated with another egg. 

After the DNA is injected, the eggs may be 
cultured to allow the pronuclei to fuse, producing one- 
cell or later stage embryos. In general, the eggs are 
cultured at approximately the body temperature of the 

20 species used in a buffered medium containing balanced 
salts and serum. Surviving embzyos are then transferred 
to pseudopregnant recipient females, typically by 
inserting them into the oviduct or uterus, and allowed to 
develop to term. Dturing embryogenesis, the injected DNA 

25 integrates in a random fashion in the genomes of a small 
number of the developing embryos. 

Potential transgenic offspring are screened via 
blood saxples and/ or tissue biopsies. DNA is prepared 
from these samples and examined for the presence of the 

30 injected construct by technigues such as polymerase chain 
reaction (PCR; see Mullis, U.S. Patent No. 4,683,202) and 
Southern blotting (Southern, J. Mol. Biol. aa:503, 1975; 
Maniatis et al.. Molecula r Cloning; A Laboratory Manu;^ ! ^ 
Cold Spring Harbor Laboratory, 1982) . Founder transgenic 

35 animals, or GOs, may be wholly transgenic, having 
transgenes in all of their cells, or mosaic, having 
transgenes in only a stibset of cells (see, for example, 
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WilJcie et al.. Develop. Biol- lig: 9-I8, 1986). In the 
latter case, groups of germ cells may be wholly or 
partially transgenic. In the latter case, the number of 
transgenic progeny from a founder animal will be less than 
5 the expected 50% predicted from Mendelian principles. 
Foxmder 60 animals are grown to sexual maturity and mated 
to obtain offspring, or Gls. The Gls are also examined 
for the presence of the transgene to demonstrate 
transmission from founder GO animals. In the case of male 

10 GOs, these may be mated with several non-transgenic 
females to generate many offspring. This increases the 
chances of observing transgene transmission. Female GO 
founders may be mated naturally, artificially inseminated 
or superovulated to obtain many eggs which are transferred 

15 to surrogate mothers. The latter course gives the best 
chance of observing transmission in animals having a 
1 imited number of young . The above-described breeding 
procedures are used to obtain animals that can pass the 
DNA on to subsequent generations of offspring in the 

20 normal, Mendelian fashion, allowing the development of, 
for exafl^le, colonies (mice) , flocks (sheep) , or herds 
(pigs, goats and cattle) of transgenic animals. 

The milJc from lactating GO and Gl females is 
examined for the expression of the heterologous protein 

25 using Jjmaunological techniques such as ELISA (see Harlow 
and Lane, Antibodies. A Laboratorv Manual, cold Spring 
Harbor Laboratory, 1988) and Western blotting (Towbin et 
al., Proc. Natl. Acad> Sci. USA 7fi* 4350-4354, 1979). For 
a variety of reasons known in the art, expression levels 

30 of the heterologous protein will be expected to differ 
between individuals. 

A satisfactory family of animals should satisfy 
three criteria: they should be derived from the same 
founder 60 animal; they should exhibit stable transmission 

35 of the transgene; and they should exhibit stable 
expression levels from generation to generation and from 
lactation to lactation of individual animals. These 
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principles have been demonstrated and discussed (Ccirver et 
Bio/Teehnn;|nTY 11: 1263-1270, 1993). Animals from 
such a suitable family are referred to as a "line." 
Initially, male animals, GO or Gl, are used to derive a 
5 flock or herd of producer animals by natural or artificial 
insemination, in this way, many female animals containing 
the same transgene integration event can be quickly 
generated from vhicdi a supply of milk can be obtained. 

The fibrinogen is recovered from milk using 

10 standard practices such as skimming, precipitation, 
filtration and protein chromatography techniques. 

Fibrinogen produced according to the present 
invention is useful within human and veterinary medicine, 
such as in the formulation of s\irgical adhesives. 

15 Adhesives of this type are known in the art. see, for 
example, U.S. Patents No. 4,377,572; 4,442,655; 4,462,567; 
and 4,627,879, which are incorporated herein by reference. 
In general, fibrinogen and factor XIII are combined to 
form a first co]q>onent that is mixed just prior to use 

20 with a second component containing thrombin. The thrombin 
converts the fibrinogen to fibrin, causing the mixture to 
gel, and activates the factor XIII. The activated factor 
XIII cross links the fibrin to strengthen and stabilize 
the adhesive matrix. Such adhesives typically contain 

25 from about 30 mg/ml to about 100 mg/ml fibrinogen and from 
about 50 Aig/ml to about 500 |»g/ml factor XIII. They may 
also contain additional ingredients, such as aprotinin, 
albumin, fibronectin, bulking agents, and solubilizers. 
Methods for producing factor XIII are known in the art. 

30 see, for exainple, U.S. Patent No. 5,204,447. The 
fibrinogen is also useful for coating surfaces of 
polymeric articles, e.g. synthetic vascular grafts, as 
disclosed in U.S. Patent No. 5,272,074 (incorporated 
herein by reference) . 

35 The invention is further illustrated by the 

following non-limiting examples. 
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Example I 

The multiple cloning site of the vector pUClB 
5 (Yanisch-Perron et al,. Gene 32:103-119, 1985) vas removed 
and replaced with a synthetic doxible stranded 
oligonucleotide (the strands of which are shown in SEQ ID 
NO: 8 and SEQ ID NO: 27) containing the restriction sites 
Pvu I/Mlu I/Eco RV/Xba I/Pvu I/Mlu I, and flanked by 5» 

10 overhangs compatible with the restriction sites Eco RI and 
Hind III. pUC18 was cleaved with both Eco RI and Hind 
III, the 5» terminal phosphate groups were removed with 
calf intestinal phophastase, and the oligonucleotide was 
ligated into the vector backbone. The DNA sequence across 

15 the junction was confirmed by secpiencing, and the new 
plasmid was called pUCPM. 

The ^-lactoglobulin (BLG) gene sec[uences from 
pSSltgXS (disclosed in WIPO publication WO 88/00239) were 
excised as a Sal I-Xba I fragment and recloned into the 

20 vector pUCPM that had been cut with Sal I and Xba I to 
construct vector pUCXS. pUCXS is thus a pUC18 derivative 
containing the entire BLG gene from the Sal I site to the 
Xba I site of phage SSI (Ali and Clark, J. Mol> Biol. 199 : 
415-426, 1988). 

25 The plasmid pSSltgSE (disclosed in WIPO 

publication WO 88/00239) contains a 1290 bp BLG fragment 
flanked by Sph I and EcoR I restriction sites, a region 
spanning a unique Not I site and a single Pvu II site 
which lies in the 5* untranslated leader of the BLG mRNA. 

30 Into this Pvu II site was ligated a double stremded, 8 bp 
DNA linker (5'-GGATAtCC-3') encoding the recognition site 
for the enzyme Eco RV. This plasmid was called 
pSSltgS£/RV. DNA sequences bounded by Sph I and Not I 
restriction sites in pSSltgSE/RV were excised by enzymatic 

35 digestion and used to replace the equivalent fragment in 
pUCXS. The resulting plasmid was called pUCXSRV. The 
sequence of the BLG insert in pUCSXRV is shown in SEQ ID 
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NO: 1, with the unique Eco RV site at nucleotide 4245 in 
the 5" untranslated leader region of the BLG gene. This 
site allows insertion of any additional DNA sequences 
under the control of the BLG promoter 3 • to the 
5 transcription initiation site. 

"iJsing the primers BLGANP3 (5»-TGG ATC CCC TGC 
CGG TGC CTC TGG-3»; SBQ ID NO: 9) and BLGAMP4 (5»-AAC 6CG 
TCA TCC TCT 6TG AGC CAG-3 • ; SEQ ID NO: 10) a PGR fragment 
Of approximately 650 bp was produced from sequances 

10 immediately 3» to the stop codon of the BLG gene in 
pUCXSRV. The PCR fragment was engineered to have a BamH I 
site at its 5* end and an Mlu I site at its 3» end and was 
cloned as such into BamH I and Mlu I cut pGEM7zf(+) 
(Promega) to give pDAN200(+). 

3^5 pUCXSRV was digested with Kpn I, and the 

largest, vector containing band was gel purified. This 
band contained the entire pUC plasmid sequences and some 
3» non*coding secpiences from the BLG gene. Into this 
backbone was ligated the small Kpn I fragment from 

20 pDAM200(+) which, in the correct orientation, effectively 
engineered a BamH I site at the extreme 5» end of the 2.6 
Kbp of the BLG 3» flanking region. This plasmid was 
called PBLAC200. A 2.6 Kbp Cla I-Xba I fragment from 
PBLAC200 was ligated into Cla I-Xba I cut pSP72 vector 

25 (Promega) , thus placing an EcoR V site immediately 
upstream of the BLG sequences. This plasmid was called 
PBLAC210. 

The 2.6 Kbp Eco RV-Xba I fragment from pBLAC210 
was ligated into Eco RV-Xba I cut pUCXSRV to form pMAD6. 

30 This, in effect, excised all coding and intron sequences 
from pUCXSRV, forming a BLG minigene consisting of 4.3 Kbp 
of 5» promoter and 2.6 Kbp of 3» downstream sequences 
flanking a unique EcoR V site. An oligonucleotide linker 
(ZC6839: ACTACGTA6T; SEQ ID NO: 11) was inserted into the 

35 Eco RV site of pMAD6. This modification destroyed the Eco 
RV site and created a Sna BI site to be used for cloning 
purposes. The vector was designated pMAD6-Sna. Messenger 
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RNA initiates upstrecun of the Sna BI site and terminates 
do%m8tream of the Sna BI site. The precursor transcxipt 
will encode a single Bl/^-derived intron, intron 6, which 
is entirely within the 3* untranslated region of the gene. 

5 

Example II 

Clones encoding the individual fibrinogen chains 
were obtained from the laboratory of Dr. Earl W. Davie, 

10 University of Washington, Seattle. A genomic fibrinogen 
Aik^chain clone (Chung et al., 1990, ibid.) was obtained 
from the plasmid BS4. This plasmid contains the Aa clone 
inserted into the Sal I and Bam HI sites of the vector 
pUC18, but lacks the coding sequence for the first foxir 

15 amino acids of the Aa chain. A genomic B^-chain DNA {Chung 
et al., ibid.) was isolated from a lambda Cheuron 4A phage 
clone (designated as two EcoRI fragments of ca. 5.6 

Kbp each. The two fragments were cloned separately into 
pUC19 that had been digested with Eco RI and treated with 

20 calf intestinal phosphatase. The resulting clones were 
screened by digestion with the restriction enzyme Pvu II 
to distinguish plasmids with the 5* and 3' B^ inserts 
(designated Betas *RI/puc and Beta3 'RI/puc, respectively). 
Genomic 7-chain clones were isolated as described by Rixon 

25 et al. f Biochemistrv 24: 2077-2086, 1985). Clone pT12A9 
comprises 5» non-coding sequences and approximately 4535 
bp of 7-chain coding sequence. Clone p712F3 comprises the 
remaining coding sequence and 3* non-coding nucleotides. 
Both are pBR322-based plasmids with the fibrinogen 

30 sequences inserted at the EcoRI site. These plasmids were 
used as templates for the respective PCR reactions. 

The fibrinogen chain coding sec[uences were 
tailored for insertion into expression vectors using the 
polymerase chain reaction (PCR) as generally described by 

35 Mullis (U.S. Patent No. 4,683,202). This procedure 
removed native 5* and 3* untranslated sequences, added a 9 
base sequence (CCT GCA GCC) upstream of the first ATG of 
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each coding sequence, supplied the first four codons for 
the Afl-chain sequence, removed an internal Mlu I site in 
the Aft sequence and added restriction sites to facilitate 
subsequent cloning steps. 
5 Referring to Figure l, the 5* end of the Aft 

coding sequence was tailored in a PC» reaction containing 
20 pmole for each of primers ZC6632 (SBQ ID NO: 12) and 
ZC6627 (SEQ ID NO: 13), approximately 10 ng of plasmid BS4 
template DNA, 10 >1 of a mix containing 2.5 mM each dNTP, 

10 7.5 111 lox Pyrococcus furiosus (Pfu) DNA polymerase buffer 
#1 (200 mM Tris-HCl, pH 8.2, 100 mM KCl, 60 mM (NH4)2S04, 
20 mM MgCl2, 1% Triton X-lOO, lOO /ig/ml nuclease free 
bovine serum albumin) (Stratagene, La Jolla, CA) , and water 
to 75 111. The mixture was heated to 940C in a DNA thermal 

15 cycler (Perlcin-Elmer Corp., Norwalk, CT) . To the heated 
mixture was added 25 ill of a mixture containing 2.5 ^1 lOx 
Pfu buffer /I, 22 ill H2O and 1 ill 2.5 units/Ml Pfu DNA 
polymerase (Stratagene) . The reactions were run in a DNA 
thermal cycler (Perkin-Elmer) for five cycles of 94«>, 45 

20 seconds; 4p«>, 90 seconds; 72«, 120 seconds; 20 cycles of 
94«, 45 seconds; 45", 90 seconds; 72«, 120 seconds; then 
incubated at 72» for 7 minutes. The 5« PCR-generated 
fragment was digested with Bam HI and Hind III, and the 
Bam HI>Hind III fragment was then ligated to an internal 

25 2.91 Kbp Hind Ill-Xba I fragment and Bam HI, Xba I- 
digested pUC18. PCR-generated exon sequences were 
sequenced. 

Referring again to Figure 1, the 3* end of the 
Aft coding sequence was tailored in a series of steps in 

30 which the Mlu I site 563 bases upstream from the stop 
codon of the Aa sequence was mutated using an overlap 
extension PCR reaction (Ho et al.. Gene 22: 51-59, 1989). 
In the first reaction 40 pmole of each of primers ZC6521 
(SEQ ID MO: 14) and ZC6520 (SEQ ID NO: 15) were combined 

35 with approximately 10 ng of plasmid BS4 template DNA in a 
reaction mixture as described above. The reaction was run 
for 5 cycles of 940, 45 seconds; 40°, 60 seconds; 720, 120 
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seconds; 15 cycles of 94^, 45 seconds; 45<>, 60 seconds; 
72®, 120 seconds; then incubated at 72® for 7 minutes. A 
second reaction was carried out in the same manner using 
40 pmole of each of primers ZC6519 (SEQ ID NO: 16) and 
5 2C6518 (SEQ ID NO: 17) and BS4 as template. The PCR- 
generated DNA fragments from the first and second 
reactions were isolated by gel electrophoresis and elution 
from the gel. Approximately 1/10 of each recovered 
reaction product was combined with 40 pmole of each of 

10 primers ZC6521 (SEQ ID NO: 14) and ZC6518 (S^ ID NO: 17) 
in a PGR reaction in which the complementary 3» ends of 
each fragment (containing the single base change) annealed 
and served as a primer for the 3 • extension of the 
complementary strand. PGR was carried out using the same 

15 reaction conditions as in the first and second 3 » PGR 
steps. The reaction product was then digested with Xba I 
and Bam HI, and the Xba I-Bam HI fragment was cloned into 
Xba I, Bam Hl-digested pUC18. PCR-generated exons were 
sequenced. 

20 As shown in Figure 1, the 5« Bam Hl-Xba I 

fragment (3.9 Kbp) and the 3" Xba I-Bam HI fragment (1.3 
Kbp) were inserted into the Bam HI site of the vector 
Zem228. Zem228 is a pUC18 derivative comprising a Bam HI 
cloning site between a mouse MT-1 promoter and SV40 

25 terminator, and a neomycin resistance marker flanked by 
SV40 promoter and terminator sequences. See European 
Patent Office Publication EP 319,944 and Fig. 2. The 
entire Aa coding sequence was isolated from the Zem228 
vector as an Sna BI fragment, which was inserted into the 

30 Sna BI site of the plasmid pliAD6-Sna. 

Referring to Fig. 3, the 5* end of the B^-chain 
was tailored by PGR using the oligonucleotides ZC6629 (SEQ 
ID NO: 18), ZG6630 (SEQ ID NO: 19) and ZG6625 (SEQ ID NO: 
20) . These primers were used in pairwise combinations 

35 (ZG6629 + ZG6625 or ZG6630 + ZG6625) to generate B^ coding 
sequences beginning at the first ATG codon (position 470 
in SEQ ID NO: 3) (designated Nl-Beta) or the third ATG 
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codon (position 512 in SEQ ID NO: 3) (designated N3-Beta) . 
Approximately 5 ng of Betas • RI/puc template DNA was 
combined with 20 pmole of each of the primers (Nl- 
Beta:ZC6629, SEQ ID NO: 18 + ZC6625, SEQ ID NO: 20; or N3- 
5 Beta:ZC6630, SEQ ID NO: 19 + ZC6625, SEQ ID NO: 20) in a 
reaction mixture as described above. The mixtures were 
incubated for 5 cycles of 94^, 45 seconds; 40^, 120 
seconds; (Nl-Beta) or 90 seconds (N3-Beta) ; 72^, 120 
seconds; 20 cycles of 94®, 45 seconds; 45^, 120 seconds; 

10 (Nl'Beta) or 90 seconds (N3-Beta) ; 72^, 120 second?; then 
incubated at 72® for 7 minutes. The two reaction products 
Nl, 555 1:^9 or N3, 510 bp) were each digested with Eco RI 
and Bgl II, and the fragments were ligated to the internal 
Bgl Il-Xba I fragment and Eco RI + Xba I-digested pUC19. 

15 The 3' end of the B^ sequence was tailored in a reaction 
mixture as described above using the oligonucleotide 
primers ZC6626 (SEQ ID NO: 21) and ZC6624 (SEQ ID NO: 22) 
and approximately 5 ng of Beta3 'RI/puc template. The 
mixtures were incubated for 5 cycles of 94^, 45 seconds; 

20 400, 90 seconds; 72^, 120 seconds; 15 cycles of 94^, 45 
seconds; 45®, 90 seconds; 72^, 12 0 seconds; then incubated 
at 72® for 7 minutes. A 990 bp Bgl Il-Eco RI fragment was 
isolated. This 3' fragment was ligated to the adjacent 
coding fragment (340 bp, Sphl-Bgl II) and Sph I + Eco RI- 

25 digested pUC19. The 3* and 5» PCR-generated exons were 
sequenced. A third intermediate vector was constructed by 
combining two internal fragments (4285 bp Xba I-Eco RI and 
383 kb Eco Rl-Sph I) in Xba I + Sph I-digested pUC19. The 
entire B^ coding secpience (two forms) was then assembled 

30 by ligating one of the 5» Eco Rl-Xba I fragments, the. 
internal Xba I-Sph I fragment, the 3» Sph I-Eco RI 
fragment and Eco Rl-digested vector pUC19. The B^ 
sequence %ras then isolated as a 7.6 Kbp Sna BI fragment 
and insexrted into the Sna BI site of pHAD6-Sna. 

35 Referring to Pig. 4, the 5* end of the gamma 

chain sequence was tailored by PGR using the 
oligonucleotide primers ZC6514 (SEQ ID NO: 23) and ZC6517 
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(SEQ ID NO: 24) and approximately 50 ng of p7l2A9 as 
template. The PGR reaction was run as described above 
using 40 pM of each primer. The reaction was run for 5 
cycles of 94", 45 seconds; 40», 60 seconds, 72», 120 
5 seconds, followed by 15 cycles of 94», 45 seconds; 45«>, 60 
seconds; 72«>, 120 seconds. The resulting 213 bp fragment 
was digested with Bam HI and Spe I, and the resulting 
restriction fragment was ligated with the adjacent 
downstream 4.4 Icb Spe I-Eco RI fragment and Bam HI + Eco 

10 HI digested pUCl9. The 3« end of the gamma chain sequence 
was tailored using oligonucleotide primers ZC6516 (SEQ ID 
HO: 25) and ZC6515 (SEQ ID NO: 26) using 40 pM of each 
primer, approximately 50 ng of pTrl2F3 template and the 
same thermal cycling schedule as used for the 5» fragment. 

15 The resulting 500 bp fragment was digested with Spe I and 
Bam HI, and the resulting restriction fragment was ligated 
with the upstream 2.77 Jcb Eco Rl-Spe I fragment and Eco RI 
+ Bam Hl-digested pDC19. All PCR-generated exons were 
sequenced. The entire 7 '-chain coding sequence was then 

20 assembled by ligating a 4.5 Kbp Bam HI -Eco RI 5' fragment, 
a 1.1 Kbp Eco Ri-pst I internal fragment and a 2.14 Kbp 
Pst I-Xba I 3» fragment in Bam HI + Xba I-digested 
Zem219b. Zem219b is a pUC18-derived vector containing a 
mouse metallothionein promoter and a DHFR selectable 

25 marker operably linked to an SV40 promoter (Fig. 5). 
Plasmid Zem219b has been deposited with American Type 
Culture Collection as an E. coli XLl-blue transformant 
under Accession No. 68979. The entire 7 •-chain coding 
sequence was then isolated as a 7.8 Kbp Sna Bl fragment 

30 and inserted into the Sna BI site of pMAD6-Sna. 

Example III 

Mice for initial breeding stocks (C57BL6J, 
CBACA) were obtained from Harlan Olac Ltd. (Bicester, UK) . 
35 These were mated in pairs to produce Pi hybrid cross 
(B6CBAF1) for recipient female, superovulated females, 
stud males and vasectomized males. All animals were kept 
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on a 14 hour light/ 10 hour dark cycle and fed water and 
food (Special Diet Searvices RM3^ Edinbxirgh, Scotland) ad 
libitum. 

Transgenic mice were generated essentially as 
5 described in Hogan et al., Manipulating the MougA ^ Hryr>> 
A ^^^ratpyy Maq^ua^^ Cold Spring Harbor Laboratory, 1986, 
vhicdi is incorporated herein by reference in its entirety. 
Female B6CBAF1 animals were superovulated at 4-5 weeks of 
age by an i . p . in j ect ion of pregnant mares • serum 

10 gonadotrophin (FOLLIGON, Vet-Drug, Falkirk, Scotland) (5 
iu) followed by an i.p. injection of human chorionic 
gonadotrophin (CHORXJLON, Vet-Drug, Falkirk, Scotland) (5 
iu) 45 hours later. They were then mated with a stud male 
overnight. Such females were next examined for copulation 

15 plugs. Those that had mated were sacrificed, and their 
eggs were collected for microinjection. 

DMA was injected into the fertilized eggs as 
described in Hogan et al. (ibid.) Briefly, each of the 
vectors containing the Ao, and 7 expression units was 

20 digested with Mlu I, and the expression units were 
isolated by sucrose gradient centrifugation. All 
chemicals used were reagent grade (Sigma Chemical Co., St. 
Louis, MO, U.S.A.), and all solutions were sterile and 
nuclease-free. Solutions of 20% and 40% sucrose in 1 M 

25 MaCl, 20 jM Tris pH 8.0, 5 mM EDTA were prepared using DHP 
water and filter sterilized. A 30% sucrose solution was 
prepared by mixing equal volumes of the 20% and 40% 
solutions. A gradient was prepared by layering 0.5 ml 
steps of the 40%, 30% and 20% sucrose solutions into a 2 

30 ml polyallomer tube and allowed to stand for one hour. 
100 111 of DMA solution (max. 8 iig DMA) was loaded onto the 
top of the gradient, and the gradient was centrifuged for 
17-20 hours at 26,000 rpm, IS^'C in a Beckman TLIOO 
ultracentrifuge using a TLS-55 rotor (Beckman Instnuaents, 

35 Fullerton, CA, USA) . Gradients were fractionated by 
puncturing the tube bottom with a 20 ga. needle and 
collecting drops in a 96 well microtiter plate. 3 ^1 
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aliguots were= analyzed on a 1% agarose mini-gel. 
Fractions containing the desired DNA fragment were pooled 
and ethanol precipitated overnight at -20»C in 0.3M sodium 
acetate. DNA pellets were resuspended in 50-100 nl OHP 
5 water and quantitated by fluorimetry. The expression 
units were diluted in Dulbecco's phosphate buffered saline 
without calcium and magnesium (containing, per liter, 0.2 
g KCl, 0.2 g KH2PO4, 8.0 g NaCl, 1.15 g Na2HP04) , mixed 
(using either the Nl-Beta or N3-Beta expression xmit) in a 
10 1:1:1 molar ratio, concentration adjusted to atout 6 
liq/ml, and injected into the eggs (-2 pi total DNA 
solutxon per egg) . 

Recipient females of 6-8 weeks of age are 
prepared by mating B6C8AF1 females in natural estrus with 
15 vasectcmized males. Females possessing copulation plugs 
are then kept for transfer of microinjected eggs. 

Following birth of potential transgenic animals, 
tail biopsies are taken, under anesthesia, at four weeks 
of age. Tissue samples are placed in 2 ml of tall buffer 
20 (0.3 M Na acetate, 50 nM HCl, 1.5 aM MgCl2, 10 mH Tris- 
HCl, pH 8.5, 0.5% NP40, 0.5% Tween 20) containing 200 
/«g/ml proteinase K (Boehringer Mannheim, Mannheim, 
Germany) and vortexed. The samples are shaken (250 rpm) 
at 55'»-60«» for 3 hours to overnight. DNA prepared from 
25 biopsy samples is examined for the presence of the 
injected constructs by PGR and Southern blotting. The 
digested tissue is vigorously vortexed, and 5 ill aliquots 
are placed in 0.5 ml microcentrifuge tubes. Positive and 
negative tail sauries are included as controls. Forty /il 
30 of silicone oil (BDH, Poole, UK) is added to each tube, 
and the tubes are briefly centrifuged. The tubes are 
incubated in the heating block of a thermal cycler (e.g. 
Omni-gene, Hybaid, Teddington, UK) to 95«C for 10 minutes. 
Following this, each tube has a 45 ill aliquot of PCR. mix 
35 added such that the final composition of each reaction mix 
is: 50 mM KCl; 2 jM MgCl2; 1° Tris-HCl (pH 8.3); 0.01% 
gelatin; 0.1% NP40, 10% DMSO; 500 nM each primer, 200 im 
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dNTPs; 0.02 U/^1 Taq polymerase (Boehringer Mannheim, 
Mannheim, Germany) . The tubes are then cycled through 30 
repeated temperature changes as required by the particular 
primers used. The primers may be varied but in all cases 
5 must target the Bix; promoter region. This is specific for 
the injected DMA fragments because the mouse does not have 
a BI/S gene. Twelve ill of 5x loading buffer containing 
Orange 6 marker dye (0.25% Orange G [Sigma] 15% Ficoll 
type 400 [Pharmacia Biosystems Ltd.^ Milton Keynes, UK]) 

10 is then added to each tube, and the reaction mixtures are 
electrophoresed on a 1.6% agarose gel containing ethidium 
bromide (Sigma) until the maurker dye has migrated 2/3 of 
the length of the gel. The gel is visualized with a UV 
light source emitting a wavelength of 254 nm. Transgenic 

15 mice having one or more of the injected DNA fragments are 
identified by this approach. 

Positive tail samples are processed to obtain 
pure DNA . The DNA samples are screened by Southern 
blotting using a BLG promoter probe (nucleotides 2523-4253 

20 of SEQ ID NO: 7). Specific cleavages with appropriate 
restriction enzymes (e.g. Eco RI) allow the distinction of 
the three constructs containing the Ao, Bfi emd y sequences. 

Southern blot analysis of transgenic mice 
prepared essentially as described £dx>ve demonstrated that 

25 more than 50% of progeny contained all three fibrinogen 
sequences. Examination of milk from positive animals by 
reducing SDS polyacrylamide gel electrophoresis 
demonstrated the presence of all three protein chains at 
concentrations up to 1 mg/ml. The amovmt of fully 

30 assembled fibrinogen was related to the ratios of 
individual subunits present in the milk. No apparent 
phenotype was associated with high concentrations of human 
fibrinogen in mouse milk. 

35 Example IV 

Donor ewes are treated with an intravaginal 
progesterone-impregnated sponge (CHRONOGEST Goat Sponge, 
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Intervet, Cambridge, DK) on day 0. Sponges are left in 
situ for ten or twelve days. 

Superovulation is induced by treatment of donor 
eves with a total of one unit of ovine follicle 
5 stimulating hormone (OFSH) (0VA6EN, Horizon Animal 
^Reproduction l-echnology Pty. Ltd., New Zealand) 
administered in eight intramuscular injections of 0.125 
units per injection starting at 5:00 pm on day -4 and 
ending at 8:00 am on day 0. Donors are injected 

10 intramuscularly with 0.5 ml of a luteolytic agent 
(ESTRUMATE, Vet-Drug) on day -4 to cause regression of the 
corpus luteum, to allow return to estrus and ovulation. 
To synchronize ovulation, the donor animals avB injected 
intramuscularly with 2 ml of a synthetic releasing hormone 

15 analog (RECEPTAL, Vet-Drug) at 5:00 pm on day 0. 

Donors are starved of food and water for at 
least 12 hours before artificial insemination (A.I.). The 
animals are artificially inseninated by intrauterine 
laparoscopy under sedation and local anesthesia on day 1. 

20 Either xylazine (ROMPDN, Vet-Drug) at a dose rate of 0.05- 

0. 1 ml per 10 kg bodyweight or ACP injection 10 mg/ml 
(Vet-Drug) at a dose rate of 0.1 ml per 10 kg bodyweight 
is injected intramuscularly approximately fifteen minutes 
before A.I. to provide sedation. A.I. is carried out 

25 using freshly collected semen from a Poll Dorset ram. 
Semen is diluted with equal parts of filtered phosphate 
buffered saline, and 0.2 ml of the diluted semen is 
injected per uterine horn. Immediately pre- or post-A.I., 
donors are given an intramuscular injection of AMOXYPEN 

30 (Vet-Drug) . 

Fertilized eggs are recovered on day 2 following 
starvation of donors of food and water from 5:00 pm on day 

1. Recovery is carried out under general anesthesia 
induced by an intravenous injection of 5% thiopentone 

35 sodium (INTRAVAL SODHJM, Vet-Drug) at a dose rate of 3 ml 
per 10 kg bodyweight. Anesthesia is maintained by 
inhalation of 1-2% Halothane/02/N20 after intubation. To 
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recover the fertilized eggs, a laparotomy incision is 
made, and the uterus is exteriorized. The eggs are 
recovered by retrograde flushing of the oviducts with Ovum 
Culture Medium (Advanced Protein Products, Brierly Hill, 
5 West Midlands, DK) supplemented with bovine serum albumin 
of New Zealand origin. After flushing, the uterus is 
returned to the abdomen, and the incision is closed. 
Donors are allowed to recover post-operatively or are 
euthanized. Donors that are allowed to recover are given 

10 an intramusculeu: in j ection of Amoxypen L . A . at the 
manufacturer's recommended dose rate immediately pre- or 
post^operatively . 

Plasmids containing the three fibrinogen chain 
expression units are digested with Mlu I, and the 

15 expression unit fragments are recovered and purified on 
sucrose density gradients. The fragment concentrations 
are determined by fluorimetry and diluted in Dulbecco*s 
phosphate buffered saline without calcium and magnesium as 
described above. The concentration is adjusted to 6 lig/ml 

20 and approximately 2 pi of the mixture is microinjected 
into one pronucleus of each fertilized eggs with visible 
pronuclei. 

All fertilized eggs sxirviving pronuclear 
microinjection are cultured in vitro at 38.5«>c in an 

25 atmosphere of 5% C02:5% 02:90% N2 and about -100% humidity 
in a bicarbonate buffered synthetic oviduct mediim (see 
Table) supplemented with 20% v/v vasectomized ram serum. 
The serum may be heat inactivated at 56^C for 30 minutes 
and stored frozen at ^2(PC prior to use. The fertilized 

30 eggs are cultured for a suitable period of time to allow 
early embryo mortality (caused by the manipulation 
techniques) to occur. These dead or arrested embryos are 
discarded. Embryos having developed to 5 or 6 cell 
divisions are transferred to synchronized recipient ewes. 
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Table 

Synthetic Oviduct Medium 



Stock A fLasts 3 Months^ 

NaCl 

KCl 

KH2PO4 

M9SO4.7H9O 

Penicillin 

Sodium Lactate 60% syrup 
Super H2O 



6.29 g 
0.534 9 
0.162 9 
0.182 9 
0.06 g 
0.6 mis 
99.4 mis 



Stock B fMpts 


2 weeks) 






NaHC03 






0.21 g 


Phenol red 






0.001 


g 


Super H2O 






10 mis 




Stock C r Lasts 


2 


weeks) 






Sodium Pyruvate 




0.051 


g 


Super H2O 






10 mis 




Stock D fLasts 


3 


months) 






CaC12.2H20 






0.262 


g 


Super H2O 






10 mis 




Stock E fLasts 


3 


months) 






Hepes 






0.651 


9 


Phenol red 






0.001 


g 



Super H2O 



10 mis 



To make up lOmls of Bicarb onate Buffered 
Medium 

STOCK A 1 ml 

STOCK B 1 ml 
STOCK C 0.07 ml 

STOCK D 0.1 ml 

Super H2O 7.83 ml 

Osmolarity should be 265-285 mOsm. 

Add 2.5 ml of heat inactivated sheep seriim 

and filter sterilize. 

To make up 10 mis of HEPES Buffered Medium 

STOCK A 1 ml 

STOCK B 0.2 ml 

STOCK C 0.07 ml 

STOCK D 0.1 ml 

STOCK £ 0.8 ml 

Super H20 7.83 ml 
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Table, conti. 

Osmolarity should be 265-285 mOsm. 
5 Add 2.5 ml of heat Inactivated sheep serum 

and filter sterilize. 

Recipient ewes are treated with an intravaginal 
progesterone-iiopregiiated sponge (Chronogest Ewe Sponge or 

10 Chronogest Ewe-Lamb Sponge, Intervet) left in situ for 10 
or 12 days. The ewes are injected intramuscularly with 
1.5 ml (300 iu) of a follicle stimulating hormone 
substitute (P.N.S.6. , Intervet) and with 0.5 ml of a 
luteolytic agent (Estrumate, Coopers Pitman-Moore) at 

15 sponge removal on day -1. The ewes are tested for es^^ls 
with a vasectomized ram between 8:00 am and 5:00 pm on 
days 0 and 1. 

Embryos surviving in vitro culture are returned 
to recipients (starved from 5:00 pm on day 5 or 6) on day 

20 6 or 7. Embryo transfer is carried out under general 
anesthesia as described above. The uterus is exteriorized 
via a laparotomy incision with or without lapsiroscopy. 
Embryos are returned to one or both uterine horns only in 
ewes with at least one suitable corpora lutea. After 

25 replacement of the uterus, the abdomen is closed, and the 
recipients are allowed to recover. The animals are given 
an intramuscular injection of Amoxypen L.A. at the 
manufacturer's recommended dose rate immediately pre- or 
post-operat ively . 

30 Lambs are identified by ear tags and left with 

their dams for rearing. Ewes and lambs are either housed 
and fed complete diet concentrates and other supplements 
and or ad lib. hay, or are let out to grass. 

Within the first week of life (or as soon 

35 thereafter as possible without prejudicing health) , each 
lamb is tested for the presence of the heterologous DNA by 
two sampling procedures. A 10 ml blood sample is taken 
from the jugular vein into an EDTA vacutainer. If fit 
enough, the lambs also have a second 10 ml blood sample 
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taken within one week of the first. Tissue samples are 
taken by tail biopsy as soon as possible after the tail 
has become desensitized after the application of a rubber 
elastrator ring to its proximal third (usually within 200 
5 minutes after tailing") • The tissue is placed 

immediately in a solution of tail buffer. Tail samples 
are kept at room tosqderature and analyzed on the day of 
collection. All lambs are given an intramuscular 
injection of Amoxypen L.A. at the manufacturer's 

10 recommended dose rate immediately post-biopsy, and the cut 
end of the tail is sprayed with an antibiotic spray. 

DNA is extracted from sheep blood by first 
separating white blood cells. A 10 ml sample of blood is 
diluted in 20 ml of Hank's buffered saline (HBS; obtained 

15 from Sigma Chemical Co.). Ten ml of the diluted blood is 
layered over 5 ml of Histopaque (Sigma) in each of two 15 
ml screw-capped tubes. The tubes are centrifuged at 3000 
rpm (2000 X g max.), low brake for 15 minutes at room 
temperature. White cell interfaces are removed to a clean 

20 15 ml tube and diluted to 15 ml in HBS. The diluted cells 
are spun at 3000 rpm for 10 minutes at room temperature, 
and the cell pellet is recovered and resuspended in 2-5 ml 
of tail buffer. 

To extract DNA from the white cells, 10% SDS is 

25 added to the resuspended cells to a final concentration of 
1%, and the tube is inverted to mix the solution. One mg 
of fresh proteinase K solution is added, £md the mixtvire 
is incubated overnight at 45»C. DNA is extracted using an 
equal volume of phenol/chloroform (x3) and 

30 chloroform/ isoamyl alcohol (xl) . The DNA is then 
precipitated by adding 0 . 1 volume of 3 M NaOAc and 2 
volumes of ethemol, and the tube is inverted to mix. The 
precipitated DNA is spooled out using a clean glass rod 
with a sealed end. The spool is washed in 70% ethanol, 

35 and the DNA is allowed to partially dry , then is 
redissolved in TE (10 mM Tris-HCl, 1 mM EDTA, pH 7.4). 
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DNA sanqples from blood and tail are analyzed by 
Southern blotting using probes for the BLG promoter region 
and the fibrinogen chain coding regions. 

Prom the foregoing, it will be appreciated that, 
5 although specific embodiments of the invention have been 
described herein for purposes of illustration, various 
modifications may be made without deviating from the 
spirit and scope of the invention. Accordingly, the 
invention is not limited except as by the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: ZymoGenetics, Inc. 

1201 Eastlake Avenue East 
Seattle, Washington 98102 
United States of America 

Pharmaceutical Proteins Ltd. 

Roslln 

Edinburgh 

Midlothian, Scotland EH25 9PP 

(11) TITLE OF INVENTION: Production of Fibrinogen in Transgenic 
Animals 

(ill) NUMBER OF SEQUENCES: 27 

(1v) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ZymoGenetics, Inc. 

(B) STREET: 1201 Eastlake Avenue East 

(C) CITY: Seattle 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 98102 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Hoppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) AHORNEY/AGENT INFORMATION: 

(A) NAME: Parker, Gary E 

(B) REGISTRATION NUMBER: 31-648 

(C) REFERENCE/DOCKET NUMBER: 93-15PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-442-6673 

(B) TELEFAX: 206-442-6678 
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(2) INFORMATION FOR SEQ ID N0:1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(11) KOLECULE TYPE: DNA (genomic) 



(vll) IMMEDIATE SOURCE: 

(B) CLONE: Hionan Fibrinogen A-alpha chain 

(1x) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: jo1n(31..84, 1154. .1279. 1739. .1922, 3055.. 3200, 

3786.. 5210) 



(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GTCTAGGAGC CAGCCCCACC CTTAGAAAAG ATG TIT TCC ATG AGG ATC GTC TGC 54 

Met Phe Ser Met Arg He Val Cys 
1 5 

CTA GTT CTA AGT GTG GTG GGC ACA GCA TGG GTATGGCCCT TTTCATTTTT 104 
Leu Val Leu Ser Val Val Gly Thr Ala Trp 
10 15 

TCTTCHGCT TTCTCTCTGG TGTTTAHCC ACAAAGAGCC TG6AGGTCAG AGTCTACCTG 164 

aCTATGTCC TGACACACTC TTAGCTTTAT GACCCCAGGC CTGGGAGGAA ATTTCCTGGG 224 

TGGGCTTGAC ACCTCAAGAA TACAGGGTAA TATGACACCA AGAGGAAGAT CTTAGATGGA 284 

TGAGAGTGTA CAACTACAAG GGAAACTTTA GCATCTGTCA HCAGTCTTA CCACATTTTG 344 

TTrrGTTTTG TTTTAAAAAG GGCAAGAATT ATTTGCCATC CHGTACCTA TAAAGCCHG 404 

GTGCAHATA ATGCTAGTTA ATGGAATAAA ACATTTTATG GTAAGATTTG TTTTCTTTAG 464 

TTATTAATTr CHGCTACH GTCCATAATA AGCAGAACH HAGTGHAG TACAGTTTTG 524 

CTGAAAGGH AHGHGTGT TT6TCAAGAC AGAAGAAAAA GCAAACGAAT TATCTHGGA 584 

AATATCTTTG CAGTATCAGA AGAGAHAGT TAGTAAGGCA ATACGCTTTT CCGCAGTAAT 644 
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GGTAnCTTT TAAAHATfiA ATCCATaCT AAAGGHACA TAGAAACTTG AAGGAGA6AG 704 

GAACAHCAG TTAAGATAGT CTAGGTTTn CTACTGAAGC AGCAAHACA 6GAGAAA6AG 764 

CTCTACAGTA GTmCAACT TTCT6TCTGC AGTCAHAGT AAAAATGAAA AGGTAAAAH 824 

TAACTGATTT TATAGATTCA AATAATTTTC CTTnAGGAT GGAnCTTTA AAACTCCTAA 884 

TATTTATCAA ATGCnATTT AA6TGTCACA CACAGHAAG AAATHGTAC ACCHGTCTC 944 

CTTTAAnCT CATAACAACT CCATAAAATG GGTCCTAGGA TTTCCAITTG AAGATAAGAA 1004 

ACCTGAAGCT TGCCGAAGCC CTGTGTCTGC TCTCCHAAT CTCTGTGAGA GTGCCATCTC 1064 

nCCTGGGGA CTTGTAGGCA TGCCACTGTC TCCTCTTCTG GCTAACAHG CTGHGCTCT 1124 

CTTTTGTGTA TGTGAATGAA TCTTTAAAG ACT GCA GAT AGT GGT GAA GGT GAC 1177 

Thr Ala Asp Ser Gly Glu G1y Asp 
20 25 

TTT CTA GCT GAA GGA G6A 6GC GTG CGT GGC CCA AGG 6TT GTG GAA A6A 1225 
Phe Leu Ala Glu Gly Gly Gly Val Arg Gly Pro Arg Val Val Glu Arg 
30 35 40 

CAT CAA TCT GCC TGC AAA GAT TCA GAC TGG CCC TTC TGC TCT GAT GAA 1273 
His Gin Ser Ala Cys Lys Asp Ser Asp Trp Pro Phe Cys Ser Asp Glu 
45 50 55 

GAC TGG GTAAGCAGTC A6CG6GGGAA GCAGGAGAH CCHCCCTCT GAT6CTAGAG 1329 
Asp Trp 
60 

GGGCTCACAG GCTGACCTGA TT66TCCCAG AAACTTTTTT AAATAGAAAA TAAHGAATA 1389 

GTTACCTACA TAGCAAATAA A6AAAA6GAA CCTACTCCCA AGAGCACTGT nATTTACCT 1449 

CCCCAACTCT GGATCATTAG TGGGTGAACA GACAGGATTT CAGTTGCATG CTCAGGCAAA 1509 

ACCAGGCTCC TGAGTATTGT G6CCTCAATT TCCTGGCACC TATTTATGGC TAAGTGGACC 1569 

CTCATTCCAG AGTTTCTCTG C6ACCTCTAA CTAGTCCTCT TACCTACTTT TAAGCCAACT 1629 

TATaGGAAG AGAAAGGGTA 6GAAGAAATG GGGGCTGCAT G6AAACATGC AAAAHATTC 1689 

TGAATCTGAG AGATAGATCC HACTGTAAT TTTCTCCCn CACTTTCAG AAC TAC 1744 

Asn Tyr 
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AAA T6C CCT TCT 6GC TGC A6G AT6 AAA GG6 HG AH GAT GAA GTC AAT 1792 
Lys Cys Pro Ser Gly Cys Arg Het Lys Gly Leu He Asp Glu Val Asn 
65 70 75 

CAA GAT TTT ACA AAC AGA ATA AAT AAG CTC AAA AAT TCA CTA TTT GAA 1840 
Gin Asp Phe Thr Asn Arg He Asn Lys Leu Lys Asn Ser Leu Phe Glu 
80 85 90 

TAT CAG AAG AAC AAT AAG GAT TCT CAT TCG HG ACC ACT AAT ATA ATG 1888 
Tyr Gin Lys Asn Asn Lys Asp Ser His Ser Leu Thr Thr Asn He Met 
95 100 105 110 



GAA AH HG AGA GGC GAT TTT TCC TCA 6CC AAT A GTAAGTAHA 1932 
Glu lie Leu Arg Gly Asp Phe Ser Ser Ala Asn 





115 




120 








CATATTTACT 


TCTTTGACn 


TATAACAGAA 


ACAACAAAAA 


TCCTAAATAA 


ATATGATATC 


1992 


CGCTTATATC 


TATGACAAH 


TCATCCCAAA 


GTACHAGTG 


TAGAAACACA 


TACCTTCATA 


2052 


ATATCCCTGA 


AAATTTTAAG 


A66GA6CTTT 


TGnrrcGTT 


AIIIIIICAA 


AGTAAAAGAT 


2112 


GTTAACTGAG 


AnGTTTAAG 


GTCACAAAAT 


AAGTCAGAAT 


TTTGGATTAA 


AACAAGAATT 


2172 


TAAATGTGH 


CTTTTCAACA 


6TATATACTG 


AAAGTA6GAT 


GGGTCAGACT 


CTTTGAGTTG 


2232 


ATATTrrTGT 


nCTGCTTTG 


TAAAGGTGAA 


AACTGAGAGG 


TCAAGGAACT 


TCnCAAAGA 


2292 


CACAGAGCTG 


GGAATTCAAC 


TCCCAGACTC 


CACTGAGCTG 


ATTAGGTAGA 


TTTTTAAATT 


2352 


TAAAATATAG 


GGTCAAGCTA 


CGTCAHCTC 


ACAGTCTACT 


CATTAGGGTT 


AGGAAACAH 


2412 


GCAHCACTC 


TGGGCAT6GA 


CAGCGAGTCT 


AGGGAGTCa 


CAGTTTCTCA 


AGTTTTGCn 


2472 


TGCCIIITTA 


CACCTTCACA 


AACACTTGAC 


ATTTAAAATC 


AGTGATGCCA 


ACACTAGCTG 


2532 


GCAAGTGA6T 


GATCCTGTTG 


ACCCAAAACA 


GCHAGGAAC 


CATTTCAAAT 


CTATAGAGTT 


2592 


AAAAAGAAAA 


GCTCATCAGT 


AAGAAAATCC 


AATATGTTCA 


AGTCCCHGA 


HAAGGATGT 


2652 


TATAAAATAA 


HGAAATGCA 


ATCAAACCAA 


CTATTTTAAC 


TCCAAAHAC 


ACCTHAAAA 


2712 


nCCAAAGAA 


AGTTCnCTT 


CTATATTTCT 


HGGGAHAC 


TAAHGCTAT 


TAGGACATCT 


2772 


TAACTGGCAT 


TCATGGAAGG 


CTGCAGGGCA 


TAACAHATC 


CAAAAGTCAA 


ATGCCCCATA 


2832 
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GGTTTTGAAC TCACAGATTA AACT6TAACC AAAATAAAAT TASGCATAH TACAAGCTAG 2892 
TTTCnTCn TCIIIIIICT CTTTCTTTCT ncnTCTTT CTTTCTrTCT TTCTTTCTrT 2952 

cnrcTTTCT ncTccncc nccTrrcn ccTncTm ttgctggcaa ttacagacaa 3012 

ATCACTCAGC AGCTACHCA ATAACCATAT TTTC6ATTTC AG AC CGT GAT AAT 3065 

Asn Arg Asp Asn 
125 

ACC TAC AAC CGA GTG TCA GAG GAT CTG AGA AGC AGA ATT GAA GTC CTG 3113 
Thr Tyr Asn Arg Val Ser Glu Asp Leu Arg Ser Arg He Glu Val Leu 
130 135 140 

AAG CGC AAA GTC ATA GAA AAA GTA CAG CAT ATC CAG CTT CTG CAG AAA 3161 
Lys Arg Lys Val He Glu Lys Val Gin His He Gin Leu Leu Gin Lys 
145 150 155 



AAT GTT AGA GCT CAG HG GH GAT ATG AAA CGA CTS GAG GTAAGTATGT 3210 
Asn Val Arg Ala Gin Leu Val Asp Met Lys Arg Leu Glu 
160 165 170 



GGCTGTGGTC 


CCGAGTGTCC TTGmTTGA GTAGAGG6AA 


AAGGAAGGC6 


ATAGTTATGC 


3270 


ACT6AGTGTC 


TACTATATGC AGAGAAAAGT GTTATATCCA 


TCATCTACCT 


AAAAGTAGGT 


3330 


AnATTTTCC 


TCACTCCACA GHGAAGAAA AAAAAAHCA 


GAGATAHAA 


GTAAATTTTC 


3390 


CAACGTACAT 


AGATAGTAAT TCAAAGCAAT 6TTCAGTCCC 


TGTCTATTCC 


AAGCCATTAC 


3450 


ATCACCACAC 


CTCTGAGCCC TCAGCCTGAG TTCACCAAGG 


ATCATTTAAT 


TAGCGTrrCC 


3510 


THGAGAGGG 


AATAGCACCT TACTCTTGAT CCAHCTGAG 


GCTAAGATGA 


AHAAACAGC 


3570 


ATCCAHGCT 


TATCCTGGCT AGCCCTGCAA TACCCAACAT 


CTCTTCCACT 


GAGGGTGCTC 


3630 


GATAGGCAGA 


AAACAGAGAA TATTAA6TGG TAG6TCTCCG 


AGTCAAAAAA 


AATGAAACCA 


3690 


GTHCCAGAA 


GGAAAAHAA CTACCAGGAA CTCAATAGAC 


GTAGTTTATG 


TATTTGTATC 


3750 


TACAnrrcT 


CTTTAI MM CTCCCCTCTC TCTAG GTG GAC ATT GAT 

Val Asp He Asp 


AH AAG 
He Lys 
175 


3803 
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ATC CGA TCT T6T C6A 6GG TCA TGC AGT AGG GCT HA GCT CGT GAA GTA 3851 
He Arg Ser Cys Arg Gly Ser Cys Ser Arg Ala Leu Ala Arg Glu Val 
180 185 190 

GAT CTG AAG 6AC TAT GAA GAT CAG CAG AAG CAA CTT GAA CAG GTC AH 3899 
Asp Leu Lys Asp Tyr Glu Asp Gin Gin Lys Gin Leu G1u Gin Val He 
195 200 205 

GCC AAA GAC HA CTT CCC TCT AGA GAT AGG CAA CAC HA CCA CTG ATA 3947 
Ala Lys Asp Leu Leu Pro Ser Arg Asp Arg Gin His Leu Pro Leu He 
210 215 220 

AAA ATG AAA CCA GH CCA GAC TTG GH CCC GGA AAT TTT AAG AGC CAG 3995 
Lys Met Lys Pro Val Pro Asp Leu Val Pro Gly Asn Phe Lys Ser Gin 
225 230 235 240 

Cn CAG AAG GTA CCC CCA GAG TGG AAG GCA HA ACA GAC ATG CCG CAG 4043 
Leu Gin Lys Val Pro Pro Glu Trp Lys Ala Leu Thr Asp Met Pro Gin 
245 250 255 

ATG AGA ATG GAG HA GAG AGA CCT GGT GGA AAT GAG ATT ACT CGA GGA 4091 
Met Arg Met Glu Leu Glu Arg Pro Gly Gly Asn Glu He Thr Arg Gly 
260 265 270 

GGC TCC ACC TCT TAT GGA ACC GGA TCA GAG AC6 GAA AGC CCC AGG AAC 4139 
Gly Ser Thr Ser Tyr Gly Thr Gly Ser Glu Thr Glu Ser Pro Arg Asn 
275 280 285 

CCT AGC AGT GCT GGA AGC TGG AAC TCT 6GG AGC TCT GGA CCT GGA AGT 4187 
Pro Ser Ser Ala Gly Ser Trp Asn Ser Gly Ser Ser Gly Pro Gly Ser 
290 295 300 

ACT GGA AAC CGA AAC CCT GG6 AGC TCT 6GG ACT GGA GGG ACT GCA ACC 4235 
Thr Gly Asn Arg Asn Pro Gly Ser Ser Gly Thr Gly Gly Thr Ala Thr 
305 310 315 320 

TGG AAA CCT GGG AGC TCT GGA CCT GGA AGT GCT GGA AGC TGG AAC TCT 4283 
Trp Lys Pro Gly Ser Ser Gly Pro Gly Ser Ala Gly Ser Trp Asn Ser 
325 330 335 



GGG AGC TCT GGA Ml GGA AGT ACT GGA AAC CAA AAC CCT GGG AGC CCT 4331 
Gly Ser Ser Gly Thr Gly Ser Thr Gly Asn Gin Asn Pro Gly Ser Pro 
340 345 350 

AGA CCT GGT AGT ACC GGA ACC TGG AAT CCT GGC AGC TCT GAA CGC GGA 4379 
Arg Pro Gly Ser Thr Gly Thr Trp Asn Pro Gly Ser Ser Glu Arg Gly 
355 360 365 
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AST 6CT GGG CAC T6G ACC TCT GAG AGC TCT GTA TCT GGT AGT ACT GGA 4427 
Ser Ala Gly His Trp Thr Ser Glu Ser Ser Val Ser 61y Ser Thr Gly 
370 375 380 

CAA T6G CAC TCT 6AA TCT GGA AGT TTT AGG CCA GAT AGC CCA GGC TCT 4475 
Gin Trp His Ser Glu Ser Gly Ser Phe Arg Pro Asp Ser Pro Gly Ser 
385 390 395 400 

GGG AAC GC6 AGG CCT AAC AAC CCA GAC TG6 GGC ACA TH GAA GAG GTG 4523 
Gly Asn Ala Arg Pro Asn Asn Pro Asp Trp Gly Thr Phe Glu Glu Val 
405 410 415 

TCA GGA AAT GTA AGT CCA GGG ACA AGG AGA GAG TAC CAC ACA GAA AAA 4571 
Ser Gly Asn Val Ser Pro Gly Thr Arg Arg Glu Tyr His Thr Glu Lys 
420 425 430 

CTG GTC ACT TCT AAA GGA GAT AAA GAG CTC AGG ACT GGT AAA GAG AAG 4619 
Leu Val Thr Ser Lys Gly Asp Lys Glu Leu Arg Thr Gly Lys Glu Lys 
435 440 445 

GTC ACC TCT GGT AGC ACA ACC ACC ACG CGT CGT TCA TGC TCT AAA ACC 4667 
Val Thr Ser Gly Ser Thr Thr Thr Thr Arg Arg Ser Cys Ser Lys Thr 
450 455 460 

GTT ACT AAG ACT GH ATT GGT CCT GAT GGT CAC AAA GAA GH ACC AAA 4715 
Val Thr Lys Thr Val He Gly Pro Asp Gly His Lys Glu Val Thr Lys 
465 470 475 480 

GAA GTG GTG ACC TCC GAA GAT GGT TCT GAC T6T CCC GAG GCA ATG GAT 4763 
Glu Val Val Thr Ser Glu Asp Gly Ser Asp Cys Pro Glu Ala Met Asp 
485 490 495 

TTA GGC ACA HG TCT GGC ATA GGT ACT CTG GAT GGG TTC CGC CAT AGG 4811 
Leu Gly Thr Leu Ser Gly He Gly Thr Leu Asp Gly Phe Arg His Arg 
500 505 510 

CAC CCT GAT GAA GCT GCC TTC HC GAC ACT GCC TCA ACT GGA AAA ACA 4859 
His Pro Asp Glu Ala Ala Phe Phe Asp Thr Ala Ser Thr Gly Lys Thr 
515 520 525 

TTC CCA GGT TTC HC TCA CCT ATG TTA GGA GAG TTT GTC AGT GAG ACT 4907 
Phe Pro Gly Phe Phe Ser Pro Met Leu Gly Glu Phe Val Ser Glu Thr 
530 535 540 
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GA6 TCT AG6 6GC TCA GAA TCT 6GC ATC HC ACA AAT ACA AAG GAA TCC 4955 
61 u Ser Arg Gly Ser Glu Ser Gly He Phe Thr Asn Thr Lys Glu Ser 
545 550 555 560 

AGT TCT CAT CAC CCT G6G ATA 6CT GAA TTC CCT TCC CGT GGT AAA TCT 5003 
Ser Ser His His Pro Gly He Ala Glu Phe Pro Ser Arg Gly Lys Ser 
565 570 575 

TCA AGT TAC AGC AAA CAA TTT ACT AGT AGC ACG AGT TAC AAC AGA GGA 5051 
Ser Ser Tyr Ser iys Gin Phe Thr Ser Ser Thr Ser Tyr Asn Arg Gly 
580 585 590 

GAC TCC ACA TH GAA AGC AAG AGC TAT AAA ATG GCA GAT GAG GCC GGA 5099 
Asp Ser Thr Phe Glu Ser Lys Ser Tyr Lys Met Ala Asp Glu Ala Gly 
595 600 605 

AGT GAA GCC GAT CAT GAA GGA ACA CAT AGC ACC AAG AGA GGC CAT GCT 5147 
Ser Glu Ala Asp His Glu Gly Thr His Ser Thr Lys Arg Gly His Ala 
610 615 620 

AAA TCT CGC CCT GTC AGA GGT ATC CAC ACT TCT CCT HG GGG AAG CCT 5195 
Lys Ser Arg Pro Val Arg Gly He His Thr Ser Pro Leu Gly Lys Pro 
625 630 635 640 

TCC CTG TCC CCC TAGACTAAGT TAAATATTTC TGCACAGTGT TCCCATGGCC 5247 
Ser Leu Ser Pro 

645 



CCTTGCATTT 


CCTTCHAAC 


TCTCTGTTAC 


ACGTCAHGA 


AACTACACn 


TTTTGGTCTG 


5307 


TTTTTGTGCT 


AGACTGTAAG 


nCCHGGGG 


GCAGGGCCn 


TGTCTGTCTC 


ATCTCTGTAT 


5367 


TCCCAAAT6C 


CTAACAGTAC 


AGAGCCATGA 


CTCAATAAAT 


ACATGTTAAA 


TGGATGAATG 


5427 


AAHCCTCTG 


AAACTCTAH 


TGAGCTTATT 


TAGTCAAATT 


CTTTCACTAT 


TCAAAGTGTG 


5487 


TGCTATTAGA 


ATTGTCACCC 


AACTGATTAA 


TCACAIIill 


AGTATGTGTC 


TCAGTTGACA 


5547 


TTTAGGTCAG 


GCTAAATACA 


AGnGTGTTA 


GTATTAAGTG 


AGCTTAGCTA 


CCTGTACTGG 


5607 


TTACTTGCTA 


nAGTTTGTG 


CAAGTAAAAT 


TCCAAATACA 


TTTGAGGAAA 


ATCCCCTTTG 


5667 


CAATTTGTA6 


GTATAAATAA 


CCGCnATTT 


GCATAAGHC 


TATCCCACTG 


TAAGTGCATC 


5727 


CTTTCCaAT 


GGAGGGAAGG 


AAAGGAGGAA 


GAAAGAAA6G 


AAGG6AAA6A 


AACAGTATTT 


5787 


GCCHATnA 


ATCTGAGCCG 


TGCCTATCn 


TGTAAAGTTA 


AATGAGAATA 


AcncnccA 


5847 
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ACCAGCHAA llllllllll AGACT6TGAT SATGTCCTCC AAACACATCC TTCAGGTACC 5907 
CAAAGTGGCA TTnCAATAT CAAGCTATCC GGATCC 5943 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Phe Ser Met Arg He Val Cys Leu Val Leu Ser Val Val Gly Thr 
15 10 15 

Ala Trp Thr Ala Asp Ser Gly 61 u Gly Asp Phe Leu Ala Glu Gly Gly 
20 25 30 

Gly Val Arg Gly Pro Arg Val Val Glu Arg His Gin Ser Ala Cys Lys 
35 40 45 

Asp Ser Asp Trp Pro Phe Cys Ser Asp Glu Asp Trp Asn Tyr Lys Cys 
50 55 60 

Pro Ser Gly Cys Arg Met Lys Gly Leu He Asp Glu Val Asn Gin Asp 
65 70 75 80 

Phe Thr Asn Arg He Asn Lys Leu Lys Asn Ser Leu Phe Glu Tyr Gin 
85 90 95 

Lys Asn Asn Lys Asp Ser His Ser Leu Thr Thr Asn He Met Glu He 
100 105 no 

Leu Arg Gly Asp Phe Ser Ser Ala Asn Asn Arg Asp Asn Thr Tyr Asn 
115 120 125 

Arg Val Ser Glu Asp Leu Arg Ser Arg He Glu Val Leu Lys Arg Lys 
130 135 140 

Val He Glu Lys Val Gin His He Gin Leu Leu Gin Lys Asn Val Arg 
1*5 150 155 160 
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Ala Gin Leu Val Asp Met Lys Arg Leu 61 u Val Asp He Asp lie Lys 
165 170 175 

He Arg Ser Cys Arg 61y Ser Cys Ser Arg Ala Leu Ala Arg 61 u Val 
180 185 190 

Asp Leu Lys Asp Tyr 61 u Asp 61 n 61 n Lys 61 n Leu 61 u 61 n Val lie 
195 200 205 

Ala Lys Asp Leu Leu Pro Ser Arg Asp Arg 61 n His Leu Pro Leu He 
210 215 220 

Lys Met Lys Pro Val Pro Asp Leu Val Pro 61y Asn Phe Lys Ser 61 n 
225 230 235 240 

Leu Gin Lys Val Pro Pro Glu Trp Lys Ala Leu Thr Asp Met Pro 61 n 
245 250 255 

Met Arg Met Glu Leu 61 u Arg Pro 61y 61y Asn 61 u He Thr Arg Gly 
260 265 270 

61y Ser Thr Ser Tyr 61y Thr 61y Ser 61 u Thr 61 u Ser Pro Arg Asn 
275 280 285 

Pro Ser Ser Ala 61y Ser Trp Asn Ser 61y Ser Ser 61y Pro 61y Ser 
290 295 300 

Thr 61y Asn Arg Asn Pro 61y Ser Ser 61y Thr 61y 61y Thr Ala Thr 
305 310 315 320 

Trp Lys Pro Gly Ser Ser Gly Pro Gly Ser Ala Gly Ser Trp Asn Ser 
325 330 335 

Gly Ser Ser Gly Thr Gly Ser Thr Gly Asn Gin Asn Pro Gly Ser Pro 
340 345 350 

Arg Pro 61y Ser Thr 61y Thr Trp Asn Pro 61y Ser Ser 61u Arg 61y 
355 360 365 

Ser Ala 61y His Trp Thr Ser 61 u Ser Ser Val Ser 61y Ser Thr Gly 
370 375 380 

61 n Trp His Ser 6lu Ser Gly Ser Phe Arg Pro Asp Ser Pro Gly Ser 
385 390 395 400 

Gly Asn Ala Arg Pro Asri Asn Pro Asp Trp Gly Thr Phe Glu Glu Val 
405 410 415 
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Ser Gly Asn Val Ser Pro Sly Thr Arg Arg 61 u Tyr His Thr 61 u Lys 

420 425 430 

Leu Val Thr Ser lys 61y Asp Lys 61 u Leu Arg Thr 61y Lys 61 u Lys 
435 440 445 

Val Thr Ser 61y Ser Thr Thr Thr Thr Arg Arg Ser Cys Ser Lys Thr 
450 455 460 

Val Thr lys Thr Val He Gly Pro Asp Gly His lys 61u Val Thr Lys 
465 470 475 480 

61 u Va1 Val Thr Ser 61u Asp 61y Ser Asp Cys Pro 61 u Ala Net Asp 
485 490 495 

Leu 61y Thr Leu Ser 61y He 61y Thr Leu Asp Gly Phe Arg His Arg 
500 505 510 

His Pro Asp 61 u Ala Ala Phe Phe Asp Thr Ala Ser Thr 61y Lys Thr 
515 520 525 

Phe Pro 61y Phe Phe Ser Pro Met Leu 61y 61 u Phe Val Ser 61 u Thr 
530 535 540 

61u Ser Arg Gly Ser Glu Ser Gly He Phe Thr Asn Thr Lys 61u Ser 
545 550 555 560 

Ser Ser His His Pro Gly He Ala Glu Phe Pro Ser Arg Gly Lys Ser 
565 570 575 

Ser Ser Tyr Ser Lys Gin Phe Thr Ser Ser Thr Ser Tyr Asn Arg 61y 
580 585 590 

Asp Ser Thr Phe 61 u Ser Lys Ser Tyr Lys Met Ala Asp 61 u Ala 61y 
595 600 605 

Ser 61u Ala Asp His Glu Gly Thr His Ser Thr Lys Arg Gly His Ala 
610 615 620 

Lys Ser Arg Pro Val Arg Gly He His Thr Ser Pro Leu 61y Lys Pro 
625 630 635 640 



Ser Leu 



Ser Pro 
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(2) INFORMATION FOR SEQ ID N0:3: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8878 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLO&Y: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(v11) IMMEDIATE SOURCE: 

(B) CLONE: human fibrinogen B-beta chain 

(1x) FEATURE: 

(A) NAME/KEY: in1sc_RNA 

(B) LOCATION: I.. 469 

(1x) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 470.. 583 

(1x) FEATURE: 

(A) NAME/KEY: Intron 

(B) LOCATION: 584.. 3257 

(1x) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 3258.. 3449 

(1x) FEATURE: 

(A) NAM^/KEY: Intron 

(B) LOCATION: 3450. .3938 

(1x) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 3939.. 4122 

(1x) FEATURE: 

(A) NAME/KEY: Intron 

(B) LOCATION: 4123.. 5042 

(1x) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 5043.. 5270 
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(1x) FEATURE: 

(A) NAHE/KEY: Intron 

(B) LOCATION: 5271.. 5830 

(1x) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 5831.. 5944 

(1x) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 5945.. 6632 

(1x) FEATURE: 

(A) NANE/KEY: exon 

(B) LOCATION: 6633.. 6758 

(1x) FEATURE: 

(A) NANE/KEY: Intron 

(B) LOCATION: 6759.. 6966 

(1x) FEATURE: 

(A) NAN^EY: exon 

(B) LOCATION: 6967.. 7252 

(ix) FEATURE: 

(A) NAME/KEY: Intron 

(B) LOCATION: 7253.. 7870 

(Ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 7871.. 8102 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 8103.. 8537 

(ix) FEATURE: 

(A) NAME/KEY: nisc.RNA 

(B) LOCATION: 8538. .8878 

(ix) FEATURE: 

(A) NANE/KEY: CDS 

(B) LOCATION: join (470.. 583, 3258. .3449, 3939.. 4122, 5043.. 5270, 

5831.. 5944, 6633.. 6758, 6967.. 7252, 7871.. 8102) 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID N0:3: 
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GAAHCATGC CCCTrTTGAA ATASACTTAT GTCAHGTCA GAAAACATAA SCATrTATGG 60 

TATATCATTA ATGAGTCACG ATTTTAGTGG HGCCnCTG AGTAGGTCAA ATTTACTAAG 120 

CnAGATTTG TTTTCTCACA TAnCTTTCG GAGCHGIGT AGTTTCCACA nAATTTACC 180 

A6AAACAAGA TACACACTCT CTTTGAGGAG TGCCCTAACT TCCCATCAH HGTCCAAn 240 

AAATGAATTG AAGAAATTTA ATGITTCTAA ACTAGACCAA CAAAGAATAA TAGHGTATG 300 

ACAAGTAAAT AAGCITTGCT GGGAAGATGT TGCTTAAATG ATAAAATGGT TCAGCCAACA 360 

AGTGAACCAA AAATTAAATA HAACTAAGG AAAGGTAACC ATTTCTGAAG TCAHCCTAG 420 

CAGA6GACTC AGATATATAT AGGAHGAAG ATCTCTCAGT TAAGTCTAC ATG AAA 475 

Net Lys 
1 

AGG ATG GH TCT TGG AGC TTC CAC AAA CTT AAA ACC ATG AAA CAT CTA 523 
Arg Het Val Ser Trp Ser Phe His Lys Leu Lys Thr Het Lys His Leu 
5 10 15 

HA TTG CTA CTA TTG TGT GTT TTT CTA GH AAG TCC CAA GGT GTC AAC 571 
Leu Leu Leu Leu Leu Cys Val Phe Leu Val Lys Ser Gin Gly Val Asn 
20 25 30 

GAC AAT GAG GAG GTGAATTTTT TAAAGCATTA TTATATTATT AGTAGTATTA 623 
Asp Asn Glu Glu 
35 

TTAATATAAG ATGTAACATA ATCATATTAT GTGCTTATTT TAATGAAATT AGCATTGCH 683 

ATAGHATGA AATGGAATTG HAACCTCTG ACHAHGTA TTTAAAGAAT GTTTCATAGT 743 

ATTTCTTATA TAAAAACAAA GTAATTTCn GTTTTCTAGT TTATCACCH TGTTTTCnA 803 

AGATGAGGAT 6GCTTAGCTA ATGTAAGATG TGTTTTTCTC ACHGCTAH CTGAGTACTG 863 

TGATTHCAT HACTTCTAG CAATACAGGA HACAATTAA GAGGACAAGA TCT6AAAATC 923 

TCACAAACTA TAAAATAATA AAAGAGCAGA ATTHAAGAT AAAAGAAACT GGTGGTA6GT 983 

AGAHGHCT TTGGTGAAGG AAG6TAATAT ATAHGTTAC TGAGATTACT ATHATAAAA 1043 

AHATAACTA AGCaAAAAG CAAAATACAT CAAGTGTAAT GATAGAAAAT GAAATAHGC ll03 
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TTTTTTCAeA TGAAAAGTTC AAATTAGA6T TAGTGT6TAT TGHAHAn AATAGTTATG 1163 

AAACACGGTT CAGTCTAATT TAfnATTTG TAGAACAGH TGTCCTCAAC TATTATTTTT 1223 

GCTGACHAT TGCTGTTAAT TTGCAGTTAC TAAAAATACA GAAATGCATT TAGGACAATG 1283 

GATATTTAAG AAATTTAAAT niATCATCA AACGTATCAT GGCCAAATTT CHACATATA 1343 

GCATAGTATC AHAAACTAG AAATAAGAAT ACACAATAAT ATTTAAATGA AGTGATTCAT 1403 

TTCGGATCAT TAnGAGTn CAAGGGAACT TGAGTGTTGT ACTTATCAGA CTCTACATGT 1463 

AAGAACATAT AGHAATCTG GTTGT6TGTG TAAAAACATA TGGTTAATCT GGTTAAGTCT 1523 

GGHAATCAT ATTAGGTAAG AAAAATGTAA AGAATGTGTA AGACGAAATT nTGTAAAGT 1583 

ACTCTGCAAA GCACTTTCAC ATTTCTGCn ATCAACTAAA CCTCACAGA6 ATAGTTTAAT 1643 

AGTHAGGCT HAAAATGGA TTTTGATTAT TCAACAAGTG GCCTTCATAA TTTCTTTAAG 1703 

TGrrmcn TAAGTATATA CmCTTTAA ATAIIIMIA AAATTTCCn nCTCTAGTA 1763 

AAGCCAGACC ATCCATGCTA CCTCTCTAGT GGCACTCTGA AATAAAAAGA AAATAGTTTr 1823 

CTCTGHATA ATTGTATTTG TAATAAGCAG ATGAATCACA ITTCTTAAAA TTTGTTrTAG 1883 

AGAGGGTAAG CTCTGACTAG 6ACCATGACT TCAATGTGAA ATATGTATAT ATCCTCCGAA 1943 

TCTTTACATA HAAGAATGT ATATAGTCAA CTGGHAAAC AGGAAAATCT GGAACAGCCT 2003 

GGCTGGGTTT TAATCHAGC ACCATCCTAC TAAATGHAA ATAATATTAT AATCTAATGA 2063 

ATAAATGACA ATGCAATTCC AAATAGAGH CATCTGATGA CHCTAGACT CACAAAAHG 2123 

CAAGAGAGCT CAGTTGnGC TCAGTTGTTC CAAATCATGT CGTTTGTTAA TTTGTAATTA 2183 

AGaCCAAAG GATGTATAGC TACTGACAAA AAAAAAAATG AGAATGTAGT TAATCCAAAT 2243 

CAAAACTHC CTAHGCAAT GCGTATTTTC TGCnCAHA TCCTTTAATA TAATATTTrA 2303 

AGHAGCAAG TAATTTTAAT TACAATGCAC AAGCCHGAG AATTATnTA AATATAAGAA 2363 

AATCATAATG TTTGATAAAG AAATCATGTA AGAAATTTCA AGATAATGGT nAACAAATA 2423 

ATTTTGnGA TAGAAGATAA GACTAAAAGT GAAAHCGAA GTGGAGAGGA CACHAAACT 2483 

GTAGTACHG HATGIGIGA TTCCAGTAAA AATAGTAATG AGCACTTAH AHGCCAAGT 2543 
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ACTGHCTGA GGGTACCATA TGCAATAAGT TATrTAATCC TTACAATAAT CTTGTAAGGC 2603 

AGAHCAAAC TATCAHACA CnATTTTAC AGATGAGAAA ACTGGG6CAC AGATAAAGCA 2663 

ACHGCCCAA GGTCTCATAG CTGTAAGTCA ACCCTAC66T CAAGACCTAC AASTAGCCGA 2723 

GCTCCAGAGT ACATTATGAG 6GTCAAAGAT TGTCnAHA CAAATAAAH CCAAGTAGAA 2783 

TCAACCTTTA ATAAGTCm AATGTCTCn AAATATGTTT ATATAG6AGT CTAATCACCA 2843 

AHCACAAAA ATGAAAGTAG G6AAATGATT AACAATAATC ATAG6AATCT AACAATCCAA 2903 

GTGGCHGAG AATATTCAn CnCHGACA GTATAGATTC TTTACAATn CGTAAGHCC 2963 

AATGTATGn HAGGAATAT GAGGTCAHA CTAHCATAA TCTGATACAG CTTTATCCTA 3023 

AGSCCTCTCT TTAAAAACTA CACTGCATCA TAGCTTTTTT GT6CAGTTGG TCTTTCTACT 3083 

GHACTGAAC AGTAAGCAAC CTACAGAHC ACTATCACCA ACCAGCCAGT TGATGGATCT 3143 

TAAGCAAAH ATCAAGCTTG TGATAACCTA AAHATAAAA TGAGGGTGH GGAATAGHA 3203 

CAHCCAAAT CTTCTATAAC ACTCTGTATT ATATTTCTGC CTCATTCCn GTAG 6GT 3260 

Gly 

nC TTC AGT GCC C6T GGT CAT CGA CCC CTT GAC AAG AA6 AGA GAA GAG 3308 
Phe Phe Ser Ala Arg Gly His Arg Pro Leu Asp Lys Lys Arg Glu Glu 
*0 45 50 55 

GCT CCC A6C CTG AGG CCT GCC CCA CC6 CCC ATC AGT GGA GGT 6GC TAT 3356 
Ala Pro Ser Leu Arg Pro Ala Pro Pro Pro He Ser Gly Gly Gly Tyr 
60 65 70 

CGG GCT CGT CCA GCC AAA GCA GCT GCC ACT CAA AAG AAA GTA GAA AGA 3404 
Arg Ala Arg Pro Ala Lys Ala Ala Ala Thr Gin Lys Lys Val Glu Arg 
75 80 85 

AAA GCC CCT GAT GCT GGA GGC TGT CU CAC GCT GAC CCA GAC CTG 3449 
Lys Ala Pro Asp Ala Gly Gly Cys Leu His Ala Asp Pro Asp Leu 
90 95 100 

GTG6GTGCAC TGATGTTTCT TGCAGTGGTG GCTCTCTCAT GCAGAGAAAG CCTGTAGTCA 3509 

TGGCAGTCTG CTAATGTTTC ACTGACCCAC AHACCATCA CTGTTATTTT GTTTGTHAT 3569 
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TTTGSAAATA AAAHCAAAA CATAAACATA nGGGCCTTT GGTTTAGGCT TTCTTTCTTG 3629 

TTrrCTTTGG TCTGGGCCCA AAATTTCAAA HAGGATATG TGGGTGCCAC CTTTCCATrT 3689 

GTATTTTGCC ACTGCCTTTG TTTAGTrGGT AAAATTTTCA TAGCCCAAH AT AII i m C 3749 

TGGGGTAAGT AATATTTTAA ATCTCTATGA 6AGTATGATG AT6ACTTTCG AATTTCTGGT 3809 

CTTACAGAAA ACCAAATAAT AAATTTTTAT GHGGCTAAT C6TATCGCTG AATTTTCCTA 3869 

TGTGCTATrr TAAlAAATGT CCAT6ACCCA AATCCHCAT CTAATGCCTG CTATTTTCn 3929 

TGTTTTTAG 6GG GTG HG TGT CCT ACA GGA TGT CAG TT6 CAA GAG 6CT 3977 
Gly Val Leu Cys Pro Thr Gly Cys Gin Leu Gin Glu Ala 
105 110 115 

HG CTA CAA CAG GAA A6G CCA ATC AGA AAT AGT GH GAT GAG HA AAT 4025 
Leu Leu Gin Gin Glu Arg Pro He Arg Asa Ser Val Asp Glu Leu Asn 
120 125 130 

AAC AAT GTG GAA GCT GTT TCC CAG ACC TCC TCT TCT TCC TTT CAG TAC 4073 
Asn Asn Val Glu Ala Val Ser Gin Thr Ser Ser Ser Ser Phe Gin Tyr 
135 140 145 

ATG TAT TTG CTG AAA GAC CTG T6G CAA AAG AGG CAG AAG CAA GTA AAA G 4122 
Met Tyr Leu Leu Lys Asp Leu Trp Gin Lys Arg Gin Lys Gin Val Lys 
150 155 160 

GTAGATATCC HGTGCTTTC CATTCGATTT TCAGCTATAA AATTGGAACC tHAGACTGC 4182 

CACGAGAATG CATGGTTGTG AGAAGATTAA CATHCTGGG TTAGTGAATA GCATTCATAC 4242 

GCrrnGGGC ACCTTCCCCT GCAACTTGCC AGATAAGCAC TAHCAGCTC HATTCCCAG 4302 

TCTGACATCA 6CAAGTGTGA TTnCTATGA AAAATTCTAC TATGACTCCT TATTTTAAGT 4362 

ATACAAGAAA CHGTGACTC AGAAGATAAT ATTTACAGAG TGGAAAAAAA CCCCTAGCAT 4422 

nATAGTTTT AACATHGAG GTTTTGAATG AGAGAGHAT CCATAATATA HCAAHGTG 4482 

HGTGGATAA TGACACCTAA CCTGTGAATC HGAGGTCAG AATGHGAGT GCTGHGACT 4542 

TGGTGGTCAG GAAACAGCTA GT6CGTGAGC CTGGCACAGG CATCTCAGTG AGTAGCATAC 4602 

CCACAGHGG AAATTTTTCA AAGAAATCAA AGGAATCATG ACATCHATA AATTTCAAGG 4662 

nCTGCTATA CHATGTGAA ATGGATAAAT AAATCAAGCA TATCCACTCT GTAAGAHGA 4722 
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ACTTCTCA6A TGGAA6ACCC CAATACT6CT TTCTCCTCn TTCCCTCACC AAAGAAATAA 4782 

ACAACCTATT TCATTTATTA CTGGACACAA TCTTTAGCGT ATACCTATGG TAAAHACTA 4842 

GTATGGTGGT TAGGATTrAT GnAATTTGT ATAT6TCATG CGCCAAATCA TTTCCACTAA 4902 

ATAT6ACTAT ATATCATAAC TGCHGGTGA TAGCTCAGTG TTTAATAGn TAHCTCAGA 4962 

AAATCAAAAT TGTATAGTTA AATACAHAG TTnATGAGG CAAAAAT6CT AACTATHCT 5022 

ACATAATTTC ATTmCCAG AT AAT 6AA AAT GTA GTC AAT SAG TAC TCC 5071 

Asp Asn Glu Asn Val Val Asn Glu Tyr Ser 
165 170 



TCA 


GAA 


CTG 


GAA 


AAG 


CAC 


CAA 


TTA 


TAT 


ATA GAT GAG 


ACT 


GTG 


AAT 


AGC 


5119 


Ser 


Glu 


Leu 


Glu 


Lys 


His 


Gin 


Leu 


Tyr 


He Asp Glu 


Thr 


Val 


Asn 


Ser 




175 










180 






185 












AAT 


ATC 


CCA 


ACT 


AAC 


CTT 


CGT 


GTG 


CTT 


CGT TCA ATC 


CTG 


GAA 


AAC 


CTG 


5167 


Asn 


He 


Pro 


Thr 


Asn 


Leu 


Arg 


Val 


Leu 


Arg Ser He 


Leu 


Glu 


Asn 


Leu 


190 










195 








200 








205 




AGA 


AGC 


AAA 


ATA 


CAA 


AAG 


HA 


GAA 


TCT 


GAT GTC TCA 


GCT 


CAA 


ATG 


GAA 


5215 


Arg 


Ser 


Lys 


He 


Gin 


Lys 


Leu 


Glu 


Ser 


Asp Val Ser 


Ala 


Gin 


Net 


Glu 












210 










215 






220 






TAT 


TGT 


CGC 


ACC 


CCA 


TGC 


ACT 


GTC 


AGT 


TGC AAT ATT 


CCT 


GTG 


GTG 


TCT 


5263 


Tyr 


Cys 


Arg 


Thr 


Pro 


Cys 


Thr 


Val 


Ser 


Cys Asn He 


Pro 


Val 


Val 


Ser 








225 










230 






235 









GGC AAA G 
Gly Lys 


GTAACTGATT 


CATAAACATA 


MM lAGAGA GHCCAGAAG 


AACTCACACA 


5320 


CCAAAAATAA 


GAGAACAACA 


ACAACAACAA 


AAATGCTAAG TGGATTTTCC 


CAACAGATCA 


5380 


TAATGACAH 


ACAGTACATC 


ATAAAAATAT 


CCTTAGCCAG TTGTGTmG 


GACTGGCCTG 


5440 


GTGCATTTGC 


TGGTTHGAT 


GAGCAGGATG 


GG6CACAGGT AGTCCCAGGG 


GTG6CTGATG 


5500 


T6TGCATCTG 


CGTACTGCa 


TGAACAGATG 


GCAGAACCAC AGATAGATGT 


AGAAGTTTCT 


5560 


CCATTTTGTG 


TGHCTGGGA 


GCTCATGGAT 


ATTCCAGGAC ACAAAAGGTG 


GAGAAGAGCT 


5620 


HGHCATCC 


TCHAGCAGA 


TAAACGTCCT 


CAAAACTGGG TTCGACHAC 


TAAAGTAAAA 


5680 
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TGAAAATCTA ATATTTGTTA TAnATTTTC AAAG6TCTAT AATAACACAC TCCTTAGTAA 5740 

CTTATGTAAT GTTATTTTAA AGAATTGGTG ACTAAATACA AAGTAATTAT GTCATAAACC 5800 

CCTGAACATA ATGHGTCH ACATTTGCAG AA TGT GAG 6AA AH ATC AGG AAA 5853 

61u Cys 61u Glu lie He Arg Lys 
240 245 

GGA GGT GAA ACA TCT GAA ATG TAT CTC KH CAA CCT GAC AGT TCT GTC 5901 
61y G1> Glu Thr Ser Glu Net Tyr Leu He Gin Pro Asp Ser Ser Val 
250 255 260 

AAA CCG TAT AGA GTA TAC TGT GAC ATG AAT ACA GAA AAT GGA G 5944 
Lys Pro Tyr Arg Val Tyr Cys Asp Net Asn Thr Glu Asn 61 y 
265 270 275 

GTAAGCTTTC GACAGTTGn GACCTGHGA TCTGTAAHA TTTGGATACC GTAAAATGCC 6004 

AGGAAACAAG GCCAGGTGTG GTGGCTCATA CCTGTAAnC CAGCACCHG GGAGGCCAAA 6064 

GTGGGCTGAT AGCTTGAGCC TAGGAGTTTG AAACTAGCCT GGGCAACATA ATGAGACCCT 6124 

AACTCTACAA AAAAAAAAAA AATACCAAAA AAAAAAAAAA AATCAGCTGT GTTGGTAGTA 6184 

TGTGCCTGTA GTCCCAGCTA TCCA6GAGGC TGAGATGGGA GATCACCTGA GCCCACAACC 6244 

TGGAGTCHG ATCAT6CTAC TGAACTGTA6 CCT6GGCAAC AGAGGATAGT GAGATCCTGT 6304 

CTCAAAAAAA AAAAHAATT AAAAAGCCAG GAAACAAGAC TTAGCTCTAA CATCTAACAT 6364 

A6CTGACAAA GGAGTAATTT GATGTGGAAT TCAACCTGAT ATTTAAAAGT TATAAAATAT 6424 

CTATAAHCA CAATTTGGGG TAAGATAAAG CACTT6CAGT HCCAAAGAT TTTACAAGn 6484 

TACCTCTCAT ATTTATTTCC TTAHGTGTC TATTTTAGAG CACCAAATAT ATACTAAAT6 6544 

GAATG6ACAG GGGAHCAGA TATTATHTC AAAGTGACAT TATTTGCTGT TGGTTAATAT 6604 

ATGCTCTTTT TGTTTCTGTC AACCAAAG GA TGG ACA GTG An CA6 AAC CGT 6655 

Gly Trp Thr Val He Gin Asn Arg 
280 285 

CAA GAC GGT AGT GH GAC TTf G6C AGG AAA TGG GAT CCA TAT AAA CAG 6703 
Gin Asp Gly Ser Val Asp Phe Gly Arg Lys Trp Asp Pro Tyr Lys Gin 
290 295 300 
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GGA TTT 66A AAT GTT GCA ACC AAC ACA GAT GGG AAG AAT TAC TGT G6C 6751 

Gly Phe Gly Asn Val Ala Thr Asn Thr Asp Gly Lys Asn Tyr Cys Gly 
305 310 315 

CTA CCA G GTAACGAACA GGCAT6CAAA ATAAAATCAT TCTATTTGAA AIGGGATTH 6808 
Leu Pro 



TTTTAAnAA AAAACATTCA TTGnGGAAG CCTGinTAG GCAGHAAGA GGAGTTTCCT 6868 

GACAAAAATG TGGAAGCTAA AGATAAGG6A AGAAA6GCAG TTTTTAGTn CCCAAAATTT 6928 

TATTTTTGGT GAGAGATTTT ATTTTGTTrT TCTTTTAG GT GAA TAT TGG GTT 6980 

Gly Glu Tyr Trp Leu 
320 

GGA AAT GAT AAA AH AGC CAG CTT ACC AGG ATG GGA CCC ACA GAA CH 7028 
Gly Asn Asp Lys He Ser Gin Leu Thr Arg Met Gly Pro Thr Glu Leu 
325 330 335 340 

HG ATA GAA ATG GAG GAC TGG AAA GGA GAC AAA GTA AAG GCT CAC TAT 7076 
Leu He Glu Ket Glu Asp Trp Lys Gly Asp Lys Val Lys Ala His Tyr 
345 350 355 

GGA GGA TTC ACT GTA CAG AAT GAA GCC AAC AAA TAC CAG ATC TCA 6TG 7124 
Gly Gly Phe Thr Val 61n Asn Glu Ala Asn Lys Tyr Gin He Ser Val 
360 365 370 

AAC AAA TAC AGA GGA ACA GCC 6GT AAT GCC CTC ATG GAT GGA GCA Id 7172 
Asn Lys Tyr Arg Gly Thr Ala Gly Asn Ala Leu Met Asp Gly Ala Ser 
375 380 385 

CAG aG ATG GGA GAA AAC AGG ACC ATG ACC AH CAC AAC GGC ATG TTC 7220 
Gin Leu Met Gly Glu Asn Arg Thr Met Thr He His Asn Gly Met Phe 
390 395 400 



nC AGC ACG TAT GAC AGA GAC AAT GAC GGC 
Phe Ser Thr Tyr Asp Arg Asp Asn Asp Gly 
405 410 



TG GTATGTGT6G 7262 

Trp 

415 



CACTCTHGC TCCTGCTTTA AAAATCACAC TAATATCAH ACTCAGAATC AHAACAATA 7322 
TTTTTAATAG CTACCACTTC CTGGGCACTT ACTGTCAGCC ACT6TCCTAA GCTCTTTATG 7382 
CATCACTCGA AAGCATHCA ACTATAAGGT AGACATTCH ATTCTCATTT TACAGATGAG 7442 



ATTTAGAGA6 AHACGTGAT TTGTCCAATG TCACACAACT ACCCAGAGAT AAAACTAGAA 



7502 
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TTT6AGCACA GTTACTTTCT 6AATAATGAG CATTTAGATA AATACCTATA TCTCTATAH 7562 

CTAAAGT6TG TGTGAAAACT nCATTTrCA TTTCCAGGGT TCTCTGATAC TAAGGGHGT 7622 

AAAAGCTAH AHCCAGTAT AAAGTAACAA ACACAGTCCC TAGATGGAH GCCACAAAGG 7682 

CCCAGHATC TCTCmCTT 6CTATAGGGC ACAGGAGGTC ITTGGTGTAT TAGTGTGACT 7742 

CTAT6TATA6 CACCCAAAG6 AAAGACTACT 6T6CACACGA GT6TAGCAGT CTTTTATGGG 7802 

TAATCT6CAA AACGTAACH GACCACCGTA GHCTGinC TAATAACGCC AAACACATTT 7862 

TCTTTCAG 6 HA ACA TCA GAT CCC AGA AAA CAG TGT TCT AAA GAA GAC 7910 
Leu Thr Ser Asp Pro Arg Lys Gin Cys Ser Lys Glu Asp 
420 425 

GGT GGT GGA TGG T6G TAT AAT AGA TGT CAT GCA GCC AAT CCA AAC 6GC 7958 
61y Gly 61y Trp Trp Tyr Asn Arg Cys His Ala Ala Asn Pro Ash Gly 
430 435 440 

AGA TAC TAC TGG GGT GGA CAG TAC ACC TGG GAC AT6 GCA AAG CAT GGC 8006 
Arg Tyr Tyr Trp Gly Gly Gin Tyr Thr Trp Asp Met Ala Lys His Gly 
445 450 455 460 

ACA GAT GAT GGT GTA GTA TGG ATG AAT TGG AAG GGG TCA TGG TAC TCA 8054 
Thr Asp Asp Gly Val Val Trp Met Asn Trp Lys Gly Ser Trp Tyr Ser 
465 470 475 

ATG AGG AAG ATG AGT ATG AAG ATC AGG CCC HC HC CCA CAG CAA TA6TCCCCAA 
8109 

Met Arg Lys Met Ser Met Lys He Arg Pro Phe Phe Pro Gin Gin 
480 485 490 

TACGTAGATT TTTGaCTTC TGTAT6TGAC AACATTTTTG TACAHATGT TAHGGAAH 8169 

nCTTTCATA CAHATAHC CTCTAAAACT aCAAGCAGA CGTGAGTGTG ACTTTTTGAA 8229 

AAAAGTATAG GATAAAHAC ATTAAAATAG CACATGATTT TCTTTTGTn TCnCATTTC 8289 

TCHGCTCAC CCAAGAAGTA ACAAAAGTAT AGTTTTGACA GAGHGGTGT TCATAATTTC 8349 

AGTTCTAGH 6ATTGCGAGA ATTTTCAAAT AAG6AAGAGG GGTCTTTTAT CCTT6TCGTA 8409 

6GAAAACCAT GAC66AAAG6 AAAAACJGAT GTHAAAAGT CCACTTHAA AACTATATH 8469 

ATTTATGTAG GATCT6TCAA A6AAAACTTC CAAAAAGAH TAHAAHAA ACCAGACTCT 8529 
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GTTGCAATAA GnAATGTTT TCnGTTrTG TAATCCACAC AHCAATGAG TTAGGCTTTG 8589 

CACTTGTAAG GAAGGAGAAG CGnCACAAC CTCAAATAGC TAATAAACCG GTCHGAATA 8649 

TTTGAAGAn TAAAATCTGA CTCTAG6AC6 6GCACGGT6G CTCACGACTA TAATCCCAAC 8709 

ACirrGGGAG GCTGAGGC6G 6CGGTCACAA GGTCAGGAGT TCAAGACCA6 CCTGACCAAT 8769 

AT6GTGAAAC CCCATCTCTA CTAAAAATAC AAAAATTAGC CAGGCGT6GT GGCAGGTGCC 8829 

TGTAGGTCCC AGCTAGCCTG TGAGGTGGAG ATTGCAHGA GCCAA6ATC 8878 

(2) INFORMATION FOR SEQ ID N0:4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Lys Arg Met Val Ser Trp Ser Phe His Lys Leu Lys Thr Met Lys 
1 5 10 15 

His Leu Leu Leu Leu Leu Leu Cys Val Phe Leu Val Lys Ser Gin 61y 
20 25 30 

Val Asn Asp Asn Glu Glu Gly Phe Phe Ser Ala Arg Gly His Arg Pro 
35 40 45 

Leu Asp Lys Lys Arg Glu G1u Ala Pro Ser Leu Arg Pro Ala Pro Pro 
50 55 60 

Pro He Ser Gly Gly Gly Tyr Arg Ala Arg Pro Ala Lys Ala Ala Ala 
65 70 75 80 

Thr Gin Lys Lys Val Glu Arg Lys Ala Pro Asp Ala Gly Gly Cys Leu 
85 90 95 

His Ala Asp Pro Asp Leu Gly Val Leu Cys Pro Thr Gly Cys Gin Leu 
100 105 110 
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61 n 61 u Ala Leu Leu 61 n 61 n 61 u Arg Pro He Arg Asn Ser Val Asp 
115 120 125 

61 u Leu Asn Asn Asn Val 61 u Ala Val Ser 61 n Thr Ser Ser Ser Ser 
130 135 140 

Phe 61 n Tyr Met Tyr Leu Leu Lys Asp Leu Trp 61 n Lys Arg 61 n Lys 
145 150 155 160 



61n Val Lots Asp Asn 61u Asn Val Val Asn 61u Tyr Ser Ser 61u Leu 
165 170 175 

61u Lys His 61n Leu Tyr He Asp 61u Thr Val Asn Ser Asn He Pro 
180 185 190 

Thr Asn Leu Arg Val Leu Arg Ser He Leu 61 u Asn Leu Arg Ser Lys 
195 200 205 

He 61n Lys Leu 61 u Ser Asp Val Ser Ala 61 n Met 61 u Tyr Cys Arg 
210 215 220 

Thr Pro Cys "^hr Val Ser Cys Asn He Pro Val Val Ser 61y l^s 61 u 
225 230 235 240 

Cys 61u 61u He He Arg Lys 61y 61y 61u Thr Ser 61u Met Tyr Leu 
245 250 255 

He 61 n Pro Asp Ser Ser Val Lys Pro Tyr Arg Val Tyr Cys Asp Met 
260 265 270 

Asn Thr 61 u Asn 61y 61y Trp Thr Val He 61 n Asn Arg 61 n Asp 61y 
275 280 285 

Ser Val Asp Phe 61y Arg lys Trp Asp Pro Tyr Lys 61 n 61y Phe 61y 
290 295 300 

Asn Val Ala Thr Asn Thr Asp 61 y Lys Asn Tyr Cys 61 y Leu Pro 61y 
305. 310 315 320 

61 u Tyr Trp Leu 61y Asn Asp Lys He Ser 61 n Leu Thr Arg Met 61y 
325 330 335 

Pro Thr 61 u Leu Leu He 61 u Met 61 u Asp Trp Lys 61 y Asp Lys Val 
340 345 350 

Lys Ala His Tyr 61y 61y Phe Thr Val 61 n Asn 61 u Ala Asn Lys Tyr 
355 360 365 
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Gin He Ser Val Asn Lys Tyr Arg Gly Thr Ala Gly Asn Ala Leu Met 
370 375 380 

Asp Gly Ala Ser G1n Leu Met Gly Glu Asn Arg Thr Met Thr He His 
385 390 395 400 

Asn Gly Met Phe Phe Ser Thr Tyr Asp Arg Asp Asn Asp Gly Trp Leu 
405 410 415 

Thr Ser Asp Pro Arg Lys Gin Cys Ser Lys Glu Asp Gly Gly Gly Trp 
420 425 430 

Trp Tyr Asn Arg Cys His Ala Ala Asn Pro Asn Gly Arg Tyr Tyr Trp 
435 440 445 

Gly Gly Gin Tyr Thr Trp Asp Met Ala Lys His Gly Thr Asp Asp Gly 
450 455 460 

Val Val Trp Met Asn Trp lys Gly Ser Trp Tyr Ser Met Arg Lys Met 
465 470 475 480 

Ser Met lys He Arg Pro Phe Phe Pro Gin Gin 
485 490 



(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: human fibrinogen gamma chain 

(ix) FEATURE: 

(A) NAHEAEY: CDS 

(B) LOCATION: join(1799..1876, 1973.. 2017, 2207.. 2390, 2510 

..2603, 4211.. 4341, 4645.. 4778, 5758.. 5942, 7426 
..7703, 9342.. 9571) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTACACACn CTTGAAGGCA AAGGCAATGC TGAAGTCACC mCATGTTC AAATCATATT 60 

AAAAAGHAG CAAGATGTAA HATCAGIGT ACTATGTAAA TCTTTGTGAA TGATCAATAA 120 

nACATATTT TCAHATATA TATTTTAGTA GATAATATTT ATATACATTC AACAHCTAA 180 

ATATAGAAAG TTTACA6AGA AAAATAAAGC CUM II ICC AATCCTGTCC TCCACCTCT6 240 

CATCCCAHC nCTTCACAG AGGCAACT6A HCAAGTCAT TACATAGHA HGAGIGHA 300 

ACTACAACTA TGHAAGTAC AGCTATATAT GHAGATGCC GTAGCCACAG AAATCAGTTT 360 

ACAATCTAAT 6CAGTGGATA CAGCATGTAT ACATATAATA TAAGGHGCT ACAAATGCTA 420 

TCTGAGGTA6 AGCTGTTTGA AAGAATACTA ATACHAAAT GTTTAAnCA ACTGACHGA 480 

HGACAACTG ATTAGCTGAG TGGAAAAGAT GGATGAGAAA GAHGTGAGA CHAAHGGC 540 

TGGTGGTATG GTGATAT6AT TGACAATAAC TGCTAAGTCA 6AGAGGGATA TAHAAGGAG 600 

GAGAAGAAAA GCAACAAATC TGGTTTTGAT GTGnCACTT TGHATAAn AnGATTATT 660 

TACTGAATAT GAATATTTAT CnTGTTTTT 6A6TCAATAA ATATACCTTT GTAAAGACA6 720 

AATTAAAGTA nAGTATTTC TTTCAAACTG GAGGCATTTC TCCCACTAAC ATATTTCATC 780 

AAAACHATA ATAAGCTTGG HCCAGAGGA AGAAATGAGG GATAACCAAA AATAGAGACA 840 

TTAATAATAG TGTAACGCCC AGTGATAAAT CTCAATA6GC AGTGATGACA GACATGTTTr 900 

CCCAAACACA AGGATGCTGT AAGGGCCAAA CAGAAATGAT GGCCCCTCCC CAGCACCTCA 960 

TTTTGCCCCT TCCTTCAGCT ATGCCTCTAC TCTCCTTTAG ATACAAGGGA GGTGGATTTT 1020 

TCTCnCTCT GAGATAGCTT GATGGAACCA CAGGAACAAT GAAGTGGGCT CCTGGCTCH 1080 

nCTCTGTGG CAGATGGG6T GCCATGCCCA CCHCAGACA AAGGGAAGAT TGAGCTCAAA 1140 

AGCTCCCTGA GAAGTGAGAG CCTAT6AACA TGGnGACAC AGAGGGACAG GAAIGTATH 1200 

CCAGGGTCAT TCAnCCTGG GAATAGTGAA CT6GGACATG GGGGAAGTCA GTCTCCTCCT 1260 

GCCACAGCCA CAGAHAAAA ATAATAATGT TAACTGATCC CTAGGCTAAA ATAATAGTGT 1320 

TAACTGATCC CTAAGCTAAG AAAGnCTTT IGGTAAHCA GGTGATGGCA GCAGGACCCA 1380 
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TCTTAAGGAT 


AGACTAGGTT 


TGCTTAGTTC 


CAE6TCATAT 


rTCTTTcrTr TrAcrrATCT 

u 1 a 1 1 1 uU 1 i» 1 LHuLUH 1 u 1 


1 AAO 


ACTGGAAGAA 


6TT6CATCAC 


ACAGCCTCCA 


ccArTcrrrT 

DOnU 1 QULU 1 


prTrrTrACA crAATcraT* 

UUILulUMLM bUMAiblaAIA 


1 CAA 
1500 


ATGCTTCACT 


AfiCCTTTGCA 


fiATAATTTTft 
Oninn 1 1 1 i u 




AAAArpTTCA crTrcrmiA 


t CCA 


AAAGGAGGA6 


CnCAACCTG 


TGTGCAAAAT 


CT6GGAACCT 


GACAGTATAG GnGGGGGCC 


1620 


AGGAT6AGGA 


AAAAGGAAC6 


GGAAA6ACCT 


GCCCACCCn 


CT6GTAAGGA 6GCCCCGTGA 


1680 


TCAGCTCCAG 


CCATTTGCAG 


TCCTGGCTAT 


CCCAGGAGCT 


TACATAAAGG GACAAHGGA 


1740 


GCCTGAGA6G 


TGACAGTGCT 


GACACTACAA 


GGCTCGGAGC 


TCCGGGCACT CAGACATC 


1798 



ATG AGT TGG TCC HG CAC CCC CGG AAT TTA AH CTC TAC HC TAT GCT 1846 
Met Ser Trp Ser Leu His Pro Arg Asn Leu He Leu Tyr Phe Tyr Ala 
15 10 15 



CTT HA rrr CTC TCT TCA ACA TGT GTA GCA GTAAGTGTGC TCTTCACAAA 1896 
Leu Leu Phe Leu Ser Ser Thr Cys Va1 Ala 
20 25 

ACGnGTTTA AAATGGAAAG CTGGAAAATA AAACAGATAA TAAACTAGTG AAATTHCGT 1956 

AiiiillCTC TTTTAG TAT GH GCT ACC AGA GAC AAC TGC TGC ATC TTA 2005 
Tyr Val Ala Thr Arg Asp Asn Cys Cys He Leu 
30 35 

GAT GAA AGA TTC GTAAGTAGTT TrTATGTTTC TCCCTTTGTG TGTGAACTGG 2057 
Asp Glu Arg Phe 
40 

AGAGGGGCAG AGGAATAGAA ATAATTCCCT CATAAATATC ATCTGGCACT TGTAACTTTT 2117 

TAAAAACATA GTCTAGGTTT TACCTATTTT TCHAATAGA TTTTAAGAGT AGCATCTGTC 2177 

TACATTTTTA ATCACTGHA TATTTTCAG 6GT AGT TAT TGT CCA ACT ACC TGT 2230 

Gly Ser Tyr Cys Pro Thr Thr Cys 
45 



GGC AH GCA GAT HC CTG TCT ACT TAT CAA ACC 
Gly lie Ala Asp Phe Leu Ser Thr Tyr Gin Thr 
50 55 60 



AAA GTA GAC AA6 GAT 
Lys Val Asp Lys Asp 
65 



2278 
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CTA CAG TCT HG GAA GAC ATC HA CAT CAA GH GAA AAC AAA ACA TCA 
Leu Gin Ser Leu Glu Asp He Leu His Gin Val Glu Asn Lys Thr Ser 
70 75 80 

GAA GTC AAA CAG CTG ATA AAA GCA ATC CAA CTC ACT TAT AAT CCT GAT 
Glu Val Lys Gin Leu He Lys Ala He Gin Leu Thr Tyr Asn Pro Asp 
85 90 95 

GAA TCA TCA AAA CCA A 6TGAGAAAAT AAAGACTACT GACCAAAAAA 
Glu Ser Ser Lys Pro 
100 

TAATAATAAT AATCTGTGAA GTTCTTTTGC TGTTGTTTTA GHGTrCTAT TTGCnAAGG 

ArmTATGT CTCTGATCCT ATAHACAG AT ATG ATA GAC GCT GCT ACT TTG 

Asn Net lie Asp Ala Ala Thr Leu 
105 110 

AAG TCC AGG ATA ATG HA GAA GAA AH ATG AAA TAT GAA GCA TCG AH 
Lys Ser Arg He Net Leu Glu Glu He Met Lys Tyr Glu Ala Ser He 





115 




120 




125 


TTA ACA CAT GAC TCA AGT AH CG 
Leu Thr His Asp Ser Ser He Arg 
130 


GTAAGGATTT nGTTTTAAT 


TTGCTCTGCA 


AGACTGATTT 


AGTTTnATT 


TAATAHCTA 


TACHGAGTG AAAGTAATTT 


HAATGTGH 


TTCCCCATH 


ATAATATCCC 


AGTGACATTA 


TGCCTGATTA 


TGHGAGCAl 


AGTAGAGATA 


GAAGTTTTTA 


GTGCAATATA 


AAHATACTG 


GGHATAATT 


GCHAHAAT 


AATCACAHG 


AAGAAAGATG 


nCTAGATGT 


CTTCAAATGC 


TAGTTTGACC 


ATATTTATCA 


AAAATTTTTT 


CCCCATCCCC 


CATTTATCTT 


ACAACATAAA 


ATCAATCTCA 


TAGGAATTTG 


GGTGTTGAAA 


ATAAAATCCT 


CTTTATAAAA 


ATGCTGACAA 


ATTGGTGGTT 


AAAAAAAHA 


GCAAGCAGAG 


GCATAGTAAG 


GATTrTGGCT 


CCTAAAGTAA 


AHATAHGA 


ATGTGGAGCA 


GGAAGAAACA 


TGTCTTGAGA 


GACTAAGTGT 


GGCAAATAH 


GCAAAGCTCA 


TAHGATCAT 


TGCAGAATGA 


ACCTGCATAG 


TCTCTTCCCT 


TCATTTG6AA 


GTGAATGTCT 


CTGTTAAAGC 


nCTCAGGGA 


CTCATAAACT 


nCTGAACAT 


AAGGTCTCAG 


ATACAGTTn 


AATAI M 1 IC 


CCCAAI 1 1 1 1 


TTTTCTGAAT 


TTTTCTCAAA 


GCAGCHGAG 


AAATTGAGAT 


AAATAGTAGC 


TAGGGAGAAG 
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TGGCCCAGGA AAGATTTCTC CTCTTTTTGC TATCAGAGGG CCCTT6TTAT TAHGHAn 

AHAHACn GCAmnAT TGTCCATCAT TGAAGHGAA GGAGGHAH GTACAGAAAT 

TGCCTAAGAC AAGGTAGASG GAAAACGTGG ACAAATAGTT T6TCTACCCT TTnTTACnC 3473 

AAAGAAAGAA CGGTTTATGC AHGTAGACA GTmCTATC ATTTnGGAT ATTTGCAAGC 3533 

CACCCTGTAA GTAACTACAA AAGGAGGGH TTTACmCC CCAGTCCATT CCCAAAGCTA 3593 

TGTAACCAGA AGCATTAAAG AAGAAAG6GG AAGTATCTGT TGTTrTATTr TACATACAAT 3653 

AACGHCCAG ATCAT6TCCC TGTGTAAGH ATATTTTAGA TTGAAGCnA TAT6TATAGC 3713 

CTCAGTAGAT CCACAAGTGA AA6GTATACT CCHCAGCAC ATGTGAAHA CTGAACTGA6 3773 

CTTTTCCTGC HCTAAAGCA TCAGGGGGTG mCTAHAA CCAGTCTCGC CACTCTTGCA 3833 

GGTTGCTATC TGCTGTCCCT TATGCATAAA GTAAAAAGCA AAATGTCAAT GACATTTGCT 3893 

TATTGACAAG GACTTTGm TTTGTGTT6G GAGTTGAGAC AATATGCCCC ATTCTAAGTA 3953 

AAAAGAHCA GGTCCACATT GTATTCCTGT TTTAATTGAT TTTTTGATTT GmTTCTTT 4013 

nCAAAAAGT HATAATTn AAnCATGH AATTTAGTAA TATAAmTA CATTTTCCTC 4073 

AAGAATGGAA TAATTTATCA GAAAGCACTT CTTAAGAAAA TACHAGCAG TTTCCAAAGA 4133 

AAATATAAAA TTACTCnCT GAAAG6AATA CnATTTrTG TCTTCnATT TTTGTTATCT 4193 

TATGTTTCTG TTTGTAG A TAT TTG CAG GAA ATA TAT AAT TCA AAT AAT CAA 4244 

Tyr Leu Gin Glu He Tyr Asn Ser Asn Asn Gin 
135 140 145 

AAG ATT GH AAC CTG AAA GAG AA6 GTA GCC CAG CH GAA GCA CAG TGC 4292 
Lys He Va1 Asn Leu Lys Glu Lys Val Ala Gin Leu Glu Ala Gin Cys 
150 155 160 

CAG GAA CCT TGC AAA GAC ACG 6TG CAA ATC CAT GAT ATC ACT 6GG AAA G 4341 
Gin Glu Pro Cys Lys Asp Thr Val Gin He His Asp He Thr Gly Lys 
165 170 175 

GTAACTGATG AAGGTTATAT TGGGATTAGG TTCATCAAAG TAAGTAATGT AAAGGAGAAA 4401 

GTATGTACTG 6AAAGTATAG GAATAGTTTA GAAAGTGGCT ACCCATTAAG TCTAAGAAH 4461 
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TCASHGTCT AGACCTTTCT T6AATAGCTA AAAAAAACAG TTTAAAAGGA ATGCTGATGT 4521 

6AAAAGTAAG AAAAnATTC HGGAAAATG AATAGTTTAC TACATGTTAA AAGCTATTFT 4581 

TCAA6GCTGG CACAGTCTTA CCTGCATTTC AAACCACAGT AAAAGTCGAT TCTCCTOTC 4641 

TAG AT TGT CAA 6AC ATT 6CC AAT AA6 6GA GCT AAA CAG AGC GGG CTT 4688 
Asp Cys G1n Asp He Ala Asn Lys 61y Ala Lys Gin Ser Gly Leu 
180 185 190 

TAC TTT AH AAA CCT CTG AAA GCT AAC CAG CAA TfC TTA 6TC TAC TGT 4736 
Tyr Phe He Lys Pro Leu Lys Ala Asn Gin Gin Phe Leu Val Tyr Cys 
195 200 205 

GAA ATC GAT GGG TCT GGA AAT GGA TGG ACT GTG TTT CAG AAG 4778 
Glu lie Asp Gly Ser Gly Asn Gly Trp Thr Val Phe Gin Lys 



210 




215 


220 






GTAAI i 1 1 1 1 


CCCCACCATG 


TCTATTTAAT 


AAAnCCTAC AnGTTTCTG 


CCATATGGCA 


4838 


GATACTTTTC 


TAAGCACCTT 


GTGAACCGTA 


GCTCATTTAA TCCTTGCAAT 


AGCCCTAAGA 


4898 


GGAAGGTACT 


TCTGTTACTC 


CTATTTACAS 


AAAAGGAAAC TGAGGCACAC 


AAGGHAAAT 


4958 


AACTTGCCCA 


AGACCACATA 


ACTAATAAGC 


AACAGAGTCA GCATTTGAAC 


CTAGGCAGTA 


5018 


TAGTTTCAGA 


GTTTGTGACT 


TGACTCTATA 


HGTACTGGC ACTGACTTTG 


TAGATTCATG 


5078 


GTGGCACATA 


ATCATAGTAC 


CACAGTGACA 


AATAAAAAGA AGGAAACTCT 


TTTGTCAGGT 


5138 


AGGTCAAGAC 


CTGAGGTTTC 


CCATCACAAG 


ATGAGGAAGC CCAACACCAC 


CCCCCACCAC 


5198 


CCCACCACCA 


TCACCACCCT 


nCACACACC 


AGA6GATACA CHGGGCTGC 


TCCAAGACAA 


5258 


GGAACCT6TG 


TTGCATCTGC 


CACHGCTGA 


TACCCACTAG GAATCHGGC 


TCCTTTACn 


5318 


TCTGTTTACC 


TCCCACCACT 


GHATAACTG 


TTTCTACA6G GG6CGCTCA6 


A6GGAATGAA 


5378 


TGGTGGAAGC 


ATTAGHGCC 


AGACACCGAT 


TGAGCAATGG GTTCCATCAT 


AAGTGTAAGA 


5438 


ATCAGTAATA 


TCCAGCTAGA 


GHCTGAAGT 


CGTCTAGGTG TCTTTHAAT 


AHACCACTC 


5498 


ATTTAGAAH 


TATGATGT6C 


CAGAAACCCT 


CHAAGTAH TCTCTTATAT 


TCTCTCTCAT 


5558 


GATCCTTGCA 


GCAACCCTAA 


GAAGTAACCA 


TCAIIIIICC TATHGATAC 


AT6AGGAAAC 


5618 


TGAGGTAGCT 


TG6CCAAGAT 


CACHAGTTG 


GGAGHGATA GAACCAGTGC 


TCTGTATTTT 


5678 
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T6ACAAAATG HGACAGCAT TCTCTTTACA TGCAHGATA GTCTATmC TCCTTTTGCT 5738 

CHGCAAATG IGTAAHAG AGA CH GAT GGC AGT GTA GAT TfC AAG AAA AAC 5790 

Arg Leu Asp Gly Ser Val Asp Phe Lys Lys Asn 
225 230 

TGG AH CAA TAT AAA GAA 6GA TTT GGA CAT CTG TCT CCT ACT GGC ACA 5838 
Trp He Gin Tyr Lys Glu Gly Phe Gly His Leu Ser Pro Thr Gly Thr 
235 240 245 

ACA GAA TTT TGG CTG GGA AAT GAG AAG ATT CAT HG ATA AGC ACA CAG 5886 
Thr Glu Phe Trp Leu Gly Asn Glu lys He His Leu He Ser Thr Gin 
250 255 260 265 

TCT GCC ATC CCA TAT GCA HA AGA GTG GAA CTG GAA GAC TGG AAT GGC 5934 
Ser Ala He Pro Tyr Ala Leu Arg Val Glu Leu Glu Asp Trp Asn Gly 
270 275 280 

AGA ACC AG GTACTGTTTT GAAATGACH CCAACTTTn AHGTAAAGA 5982 
Arg Thr Ser 



TTGCCTGGAA TGTGCACTTT CCAACTATCA ATAGACAATG GCAAATGCAG CCTGACAAAT 6042 

GCAAACAGCA CATCCAGCCA CCATTTTCTC CAGGAGTCTG TnGGTTCn GGGCAATCCA 6102 

AAAAGGTAAA TTCTAHCAG GATGAATCTA AGTGTAHGG TACAATCTAA TTACCCTGGA 6162 

ACCATTCAGA GTAATAGCTA AHACTGAAC TTTrAATCAG TCCCAGGAAT TGAGCATAAA 6222 

AnATAATTT TATCTAGTCT AAAHACTAT HCATGAAGC AGGTAHATT AHAATCCCA 6282 

TTHATAGAT TAACHGCTC AAAGTCACAT TGCTGATAAG TGGTAGAGGT AGAAHCAGA 6342 

CTCAAGTAGT nAACTTTAG AGCCTGTCCT CTTAACAACT ATCCTGGHG AAAAGCAAAT 6402 

ACAGCCTCn CAGACTTCTC AGTGCCTTGA TGGCCATTTA HCTGTCAAA TCATGAGCTA 6462 

CCCTAAAAGT AAACCAGCTA GCTCTTTTGA TGATCTAGAG GCTTCTTTTT GCTTGAGATA 6522 

TnGAAGGTT TTAAGCAHG HACCTAAH AAAATGCAGA AAAATATCCA ACCCTCTTGT 6582 

TATGrrTAAG GAATA6TGAA ATAJAHGTC HCAAACACA TGGACTTTTT TTTATTGCn 6642 

GGHGGTTn TAATCCAGAA AGTGCTATAG TCAGTAGACC HCTTCTAGG AAAGGACCTT 6702 
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CCATTTCCCA GCCACTB6AG ATTAGAAAAT AAGCTAAATA TTTTCTGGAA ATTTCTGnC 6762 

AHCAHAAG 6CCCATCCTT TCCCCCACTC TATAGAAGTG TTGTCCACn GCACAATTTT 6822 

nCCAGGAAA GAATCTCTCT AACTCCHCA GCTCACATGC TTTGGACCAC ACAGG6AAGA 6882 

CTTTGAnGT GTAATGCCCT CAGAAGCTa CCTTCTTGCC ACTACCACAC TGATTTGAGG 6942 

AAGAAAATCC CTTTAGCACC TAACCCTTCA GGTGCTATGA GTGGCTAATG GAACTGTACC 7002 

TCCTTCAAGT nTGTGCAAT AATTAAGGGT CACTCACTGT CAGATACTTT CTGTGATCTA 7062 

TGATAATGTG TGT6CAACAC ATAACATTTC AATAAAAGTA GAAAATATGA AAHAGAGTC 7122 

ATCTACACAT CTGGATTT6A TCHAGAATG AAACAAGCAA AAAAGCATCC AAGTGA6TGC 7182 

AAHAHAGT TTTCAGAGAT GCHCAAAGG CTTCTAGGCC CATCCCGGGA AGTGHAATG 7242 

AGCTGTGGAC TGGTTCACAT ATCTATTGCC TCHGCCAGA TTTGCAAAAA ACHCACTCA 7302 

ATGAGCAAAT HCAGCCTTA AGAAACAAAG TCAAAAAHC CAAGGAA6CA TCCTACGAAA 7362 

GAGGGAACTT CTGAGATCCC TGAGGAGGGT CAGCATGTGA TGGTTGTATT TCCnCTTCT 7422 

CAG T ACT GCA GAC TAT GCC ATG TTC AAG GTG GGA CCT GAA GCT GAC 7468 
Thr Ala Asp Tyr Ala Met Phe Lys Val Gly Pro Glu Ala Asp 
285 290 295 

AAG TAC CGC CTA ACA TAT GCC TAC HC GCT GGT GGG GAT GCT GGA GAT 7516 
Lys Tyr Arg Leu Thr Tyr Ala Tyr Phe Ala Gly Gly Asp Ala Gly Asp 
300 305 310 

GCC TTT GAT GGC TH GAT TTT 6GC GAT GAT CCT AGT GAC AAG TTT HC 7564 
Ala Phe Asp Gly Phe Asp Phe Gly Asp Asp Pro Ser Asp Lys Phe Phe 
315 320 325 330 

ACA TCC CAT AAT GGC ATG CAG TTC AGT ACC TGG GAC AAT GAC AAT GAT 7612 
Thr Ser His Asn Gly Met Gin Phe Ser Thr Trp Asp Asn Asp Asn Asp 
335 340 345 

AAG TTT GAA GGC AAC TGT GCT GAA CAG GAT GGA TCT GGT TGG TGG ATG 7660 
Lys Phe Glu Gly Asn Cys Ala Glu Gin Asp Gly Ser Gly Trp Trp Met 
350 355 360 

AAC AAG TGT CAC GCT GGC CAT CTC AAT GGA GH TAT TAC CAA G 7703 
Asn Lys Cys His Ala Gly His Leu Asn Gly Val Tyr Tyr Gin 
365 370 375 
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GTAT6TTTTC CTTTCnAGA TTCCAAGTTA 
TAATAAATA6 ATAT6AAGAA ATGAAGAATA 
nCTTTATTT CAACTAAGH CTTTGAAACT 
AAHAGCCCT TCTAAAGAAA TCCACCTGCT 
TGAAACTATC ACCATCATAG CT6GCACTGT 
ATAAATGATC TGGATGGATT AACATTACTA 
mCCATGAG CTTTGTGTCA GATACACGCA 
6A6AAG6CAA GCACTGGTAA GGGCAGGG6T 
GAGGACAGTA GACACTTATT HAGGATGGG 
AGCHGGAAT GGTTTGGAAC ACTGGTTTCA 
GCOTACCGA TGCTAAA66A TCCAT6TTAC 
TATTCCATGA ATAAAACCAC TATGAAGATA 
GGACAGCAAC ACCACCCTGG GAAAGCCAAA 
GGACmCCT ACCATATCTG CTGGGTAGAT 
GCTTGTAGCA AACCATAGTC TCCTCATCTA 
CTAGCCAATC ACCAACTAGG AAACTTTGCA 
ACAATATTTT TAAAHGGAG AAACATAACT 
AAGAAACTCT AAGGGAAAAT GCTTGATCTG 
GHCATGCCA GT6TTGTGCT CTGACAAGCC 
TAGGACAAGG AGAAGGCAAA TCAACA6AGG 
ATGCCAGAAC CATTTCCTAA AGCTCCTCAA 
CTGA6TT6AC CTCATCAGGA ATTTTGTGGC 
T6GTCCACAG CA6CTGAGTC CGGGATG6AT 



PCT/DS95/02648 

65 

ATGTATAGTG TATACTATTT TCATAAAAAA 7763 

ATTTATAAAG ATAGTAG66A TTTTATCATG 7823 

6GAAGTGGAT AATACCAAGT TCATGCCTAA 7883 

GCAAAATATC CAGTAGITTG GCAHATATG 7943 

6GGTTGTGGG ATCTCCTTTA GACATACAAC 8003 

CAT6GATGCT TGTTGACACA nAACCTGGC 8063 

GTGAACAGGT GTTTGGAGGA ACAGAATAAA 8123 

nCTGAAAGC TTGAGAGAA6 A6ACCAGTCT 8183 

GGHGGATGA GGAGGCTATA GrTTGCTATA 8243 

CTCACCTACC CAGCAGHAT GTGTGGGGAA 8303 

AATAATGGCA TTATnGGAA ATCCCAGTGG 8363 

ATCCCACTCA ACA6ACTCTC CGTTGGAGAA 8423 

CAGTCAGACC AGACCTGTTT AGCATCAGTA 8483 

GAGTGAAACC AGTGHCCAA ACCACTCCGG 8543 

CCAAGATGAG CAACCTTACC TCCTGATGTC 8603 

CAGTTTATrr AAAGTAACAG mGATTTTC 8663 

TATCTHGCA CTCACAAACC ACATAATGAG 8723 

TGTGACCCGG GGCGCCATGC CAGAGCTGTA 8783 

TTTTACAGAA TTACATGAGA TCTGCTTCCC 8843 

CTGCACTTTA AAATGGAGAC ATAAAATAAC 8903 

TCAACCAACA AAATTGTGCT HCAAATAAC 8963 

TCCnCTCTT CTAACCTGCC TGAAGAAAGA 9023 

AAGCTTAGGG ACAGA6GCCA ATTAGGGAAC 9083 
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TrrSGGTTTC TASCCCTACT A6TAST6AAT AAATTTAAAG TSTGGATGTG ACTATGAGTC 9143 

ACAGCACAGA TGnGITTAA TAATATGTTT AnTTATAAA nGATATTTT AGGAATCTTT 9203 

GGAGATATTT TCAGTTAGCA GATAATACTA TAAATniAT GTAACTGGCA ATGCACHCG 9263 

TAATAGACAG CTCTTCATAG ACTTGCAGAG GTAAAAAGAT TCCAGAATAA TGATAT6TAC 9323 

ATCTAC6ACT TGTTTTAG GT 6GC ACT TAC TCA AAA GCA TCT ACT CCT AAT 9373 

Gly Gly Thr Tyr Ser lys Ala Ser Thr Pro Asn 
380 385 

GGT TAT GAT AAT GGC ATT ATT TGG GCC ACT TGG AAA ACC CGG TGG TAT 9421 
Gly Tyr Asp Asn Gly lie He Trp Ala Thr Trp Lys Thr Arg Trp Tyr 
390 395 400 

TCC ATG AAG AAA ACC ACT AT6 AAG ATA ATC CCA HC AAC AGA CTC ACA 9469 
Ser Met Lys Lys Thr Thr Met Lys He He Pro Phe Asn Arg Leu Thr 
405 410 415 

AH GGA GAA GGA CAG CAA CAC CAC CTG GGG GGA GCC AAA CAG GTC AGA 9517 
He Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys Gin Val Arg 
420 425 430 435 

CCA GAG CAC CCT GCG 6AA ACA GAA TAT GAC TCA CTT TAC CCT GAG GAT 9565 
Pro Glu His Pro Ala Glu Thr Glu Tyr Asp Ser Leu Tyr Pro Glu Asp 
440 445 450 

GAT HG TAGAAAATTA ACTGCTAACT TCTAHGACC CACAAAGTIT CAGAAAHCT 9621 
Asp Leu 

CTGAAAGTTT CTTCCTTTTT TCTCHACTA TATTTAnGA TTTCAAGTCT TCTAHAAGG 9681 

ACATTTAGCC TTCAATGGAA ATTAAAACTC ATTTAGGACT GTATTTCCAA ATTACTGATA 9741 

TCAGAGTTAT TTAAAAAHG TTTATTTGAG GAGATAACAT TTCAACTTTG mCTAAATA 9801 

TATAATAATA AAATGAHGA CTTTATTTGC ATTTTTATGA CCACHGTCA TTTATTnGT 9861 

CnCGTAAAT TATTTTCATT ATATCAAATA TTTTAGTATG TACHAATAA AATAGGAGAA 9921 

CATTTTAGAG TTTCAAAnC CCAGGTATTT TCCTTGTrTA HACCCCTAA ATCAHCCTA 9981 

TTTAAnCTT CTTTnAAAT GGAGAAAAH ATGTCTTTn AATATGGTTT nGTHTGn 10041 

ATATAHCAC AGGCT6GA6A CGTTTAAAAG ACCGTITCAA AAGAGATTTA C M IIIIAAA 10101 
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GGACnTATC TGAACA6A6A 6ATATAATAT TTTTCCTAn 66ACAATGGA CHGCAAAGC 10161 

nCACnCAT TTTAAGAGCA AAAGACCCCA TGHGAAAAC TCCATAACA6 mTATGCTG 10221 

ATGATAATH ATCTACATGC ATTTCAATAA ACCTTTTGTT TCCTAAGACT AGATACAT6G 10281 

TACCTTTAn GACCAHAAA AAACCACCAC TTTTTGCCAA nTACCAATT ACAAHGGGC 10341 

AACCATCAGT AGTAATT6AG TCCTCATTTT ATGCTAAATG HATGCCTAA CTCTTTGGGA 10401 

GHACAAAGG AAATAGCAAT TATGGCTTTT GCCCTCTAGG AGATACAGGA CAAATACAGG 10461 

AAAATACAGC AACCCAAACT GACAATAaC TATACAAGAA CATAATCACT AAGCAGGAGT 10521 

CACAGCCACA CAACCAAGAT GCATAGTATC CAAAGT6CAG CTG 10564 

(2) INFORMATION FOR SEQ ID N0:6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

Met Ser Trp Ser Leu His Pro Arg Asn Leu He Leu Tyr Phe Tyr Ala 
15 10 15 

Leu Leu Phe Leu Ser Ser Thr Cys Val Ala Tyr Val Ala Thr Arg Asp 
20 25 30 

Asn Cys Cys He Leu Asp 61 u Arg Phe Gly Ser Tyr Cys Pro Thr Thr 
35 40 45 

Cys Gly He Ala Asp Phe Leu Ser Thr Tyr Gin Thr Lys Val Asp Lys 
50 55 60 

Asp Leu Gin Ser Leu Glu Asp He Leu His Gin Val Glu Asn Lys Thr 
65 70 75 80 

Ser Glu Val Lys Gin Leu He Lys Ala He Gin Leu Thr Tyr Asn Pro 
85 90 95 
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Asp Glu Ser Ser Lys Pro Asn Het lie Asp Ala Ala Thr Leu Lys Ser 
100 105 110 

Arg He Met Leu Glu Glu He Met lys Tyr Glu Ala Ser He Leu Thr 
115 120 125 

His Asp Ser Ser He Arg Tyr Leu Gin Glu He Tyr Asn Ser Asn Asn 
130 135 140 

Gin Lys He Val Asn Leu Lys Glu Lys Val Ala Gin Leu Glu Ala Gin 
145 150 155 160 

Cys Gin Glu Pro Cys Lys Asp Thr Val Gin He His Asp He Thr Gly 
165 170 175 

Lys Asp Cys Gin Asp He Ala Asn Lys Gly Ala Lys Gin Ser G1y Leu 
180 185 190 

Tyr Phe He Lys Pro Leu Lys Ala Asn G1n Gin Phe Leu Val Tyr Cys 
195 200 205 

Glu He Asp Gly Ser Gly Asn Gly Trp Thr Val Phe Gin Lys Arg Leu 
210 215 220 

Asp Gly Ser Val Asp Phe Lys Lys Asn Trp He Gin Tyr Lys Glu Gly 
225 230 235 240 

Phe G(y His Leu Ser Pro Thr Gly Thr Thr Glu Phe Trp Leu Gly Asn 
245 250 255 

Glu Lys He His Leu He Ser Thr Gin Ser Ala He Pro Tyr Ala Leu 
260 265 270 

Arg Val Glu Leu Glu Asp Trp Asn Gly Arg Thr Ser Thr Ala Asp Tyr 
275 280 285 

Ala Met Phe Lys Val Gly Pro Glu Ala Asp Lys Tyr Arg Leu Thr Tyr 
290 295 300 

Ala Tyr Phe Ala Gly Gly Asp Ala Gly Asp Ala Phe Asp Gly Phe Asp 
305 310 315 320 

Phe Gly Asp Asp Pro Ser Asp Lys Phe Phe Thr Ser His Asn Gly Het 
325 330 335 

Gin Phe Ser Thr Trp Asp Asn Asp Asn Asp Lys Phe Glu Gly Asn Cys 
340 345 350 
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Ala 61 u Gin Asp Gly Ser Gly Trp Trp Met Asn Lys Cys His Ala Gly 
355 360 365 

His Leu Asn Gly Val Tyr Tyr Gin Gly Gly Thr Tyr Ser Lys Ala Ser 
370 375 380 

Thr Pro Asn Gly Tyr Asp Asn Gly He He Trp Ala Thr Trp Lys Thr 
385 390 395 400 

Arg Trp Tyr Ser Met Lys Lys Thr Thr Met Lys He He Pro Phe Asn 
*05 410 415 

Arg Leu Thr He Gly Glu Gly Gin Gin His His Leu Gly Gly Ala Lys 
420 425 430 

Gin Val Arg Pro Glu His Pro Ala Glu Thr Glu Tyr Asp Ser Leu Tyr 
435 440 445 

Pro Glu Asp Asp Leu 
450 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(vli) IMMEDIATE SOURCE: 

(8) CLONE: ovine beta-lactoglobulin 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

ACGCGTGTCG ACCTGCAGGT CAACGGATCT CTGTGTCTGT TTTCATGnA GTACCACACT 60 

GTTHGGTGG CTGTAGCTTT CAGCTACAGT CTGAAGTCAT AAAGCCTG6T ACCTCCAGCT 120 

CTGHCTCTC TCAAGATTGT GTTaGCTGT TTGGGTCTTr A6TGTCTCCA CACAATTTTT 180 

A6AATTGTTT GTTCTAGnC TGTGAAAAAT GATGCTGGTA TTTTGATAAG GATTGCAHG 240 

AATCTGTAAA GCTACAGATA TAGTCAHGG GTAGTACAGT CACTTTAACA ATAHAACTC 300 
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nCACATCTG TSAGCATGAT ATATTTTCCC CCTCTATATC ATCnCAATT CCTCCTATCA 
GmCTnCA TTGCAGTTrr CTGA6TACA6 GTCHACACC TCCnGGHA GAGTCAHCC 
TCAGTATTTT AncCTTTGA TACAAHCTG AATGAG6TAA TmcnAGT nCTCTTTCT 
6ATAGCTCAT TGTTAGT6TA TATATAGAAA AGCAACAGAT TTCTATGTAT TAATTTT6TA 
TCCTGCAACA GATTTCTATG TAnAATTn GTATCCTGCT ACTTTACGGA AHCACmT 
TAGCTTTrTG GTGACATCn GAGGAimC TGAAGAAAAT 6GCATGGTAT GGTAGGACAA 
GGTGTCATGT CATCTGCAAA CAGT6GCA6T mCCTTCTT CCCmCAAC CTGGATTTCT 
nGATTTCn TCTGTCT6AG TACGACTAGG AHCCCAATA CTATACC6AA TAAAAGTGGC 
AAGAGTGGAC ATCCHGTCT TATTTTTCT6 ACCTTAGAGG AAATGCTTTC AGTTTTTCAC 
CAHAATTAT AATGTTTACT GTGGGCTTGT CATAT6TGGC CnCAHATA TGGAGGTCTA 
TTCCCTCTAT ACCCACCTTG TTGAGAGTTT TTATCATAAA AGTATGTTGA ATnTGTCAA 
AAGTTTTTCC T6CATCTATT GAGATGATTT TTACTCnCA ATTCAHAAT GATTTTTAn 
CnCATTTTG TTAATGATTT CCATTCTTCA ATTTGnAAC 6TGGTATATC ACAHGATTG 
ATTTGTGGAT ACCTTTGTAT CCCT6GGATA AACCTCACn GATCATGAGC TTTCAATGTA 
TTTTTGAATT CACTTTGCTA ATATTCTGTT GGGTATTTTT 6CATCTCTAT TCATCAATGA 
TAHGGCCTA AGAAAGGTTT TGTaGGTTT TAGTATCAGG GTGATGCTGG CCTCATAGAG 
AGAGTTTAGA AGCATTTCCT CCTCTTTGAT TTTTCGGAAT AGTTTGAGTA GGATAGGTAT 
TAACTCnCT TTAAATGTrT GGGGACTTCC CTGGTGAGCC GGTGGHGAG AATCCGCCTC 
AGGGATGTGG GrTTGATCCC TGGTCAGGGA ACCAHAATA AGATCCCACA TGCTGCAGGC 
AACAAGCCCC CAAGCTGCAA CCACTGAGCT GCAACCGCTG CAGTGCCCAC AGGCCACGAC 
CAGAGAAAGC CCACATACAG CAGGGAAGAC CCAGCACAAC CGGAAAAAGG AGTTTGGTGG 
AATACAGCTG T6AAGCCGTC TGGTCCTGGA CTCCTGtrTTG AGGGAATnT HAAAAATTA 
HGAnCAAT nCAHACTG GTAACTGGTC TGnCATAH nCTATTTCT TCCGGGHCA 
GTCHGGGAG AHGTACATG CCTAGGAATG TGTCCGTnC nCTAGGTTG TCCATTTTAT 



T6GACAT6CA T6GGAGCACA CA6CACCGAC 
A66CTG6GGC CCCAAGCAGC ATGGCATCCT 
AGCCCCACAA TnCATTCTG AGAAGTGATT 
TGACCTGCn aGAGGAGCA GGGGnTTGG 
GGTCCAG6TC CCCTCCCAG6 CCCCCCT6TC 
TCTCCCTCCT ACAGTGGTCA GTCCCA6CTG 
CTCTCTCTGG ATGGTATTCT CTGGAAGCTG 
CTGAAGGGCA TGGTTTGTGG TCAC6GTTCA 
GTCCCGAGAT TGGTGGCACC CAGATTTCCT 
TCCCTGGCTG ACCTCCCTCC TCCCTGCATC 
GTCACA6CCT CTCGCATCTA ACGCCGGT6T 
TACCTATGGG GAAGGGCTTC TCACTGCAGT 
TCCCAGTCCG GACGTCAAAC AGGCCGAGCT 
CCCCCCCGCT 6CTGCCTCCA GCTCCT66TG 
CTCAGTCTAG TCTCT6CCTC CGIGHCACA 
CGTTTTCTCT CACAAGGACA CCGGACATTA 
AGCTCACATC TGTAAAGACC TAGATTCCAA 
TGTGAGTTCT GGGGCGACAT CCHCAACCC 
GGAACCTGGG GTTTATC6TA AAACCCAGGT 
HGCAAGAAT TAAAGGT6CT AATACA6ATC 
TCTTTCCATC ACAGGAAAGG GGGGCCTGGG 
CTCTTTCCTG CTACAGTCAC CAACAGTCTC 
AGCCG6AGCT AGTTTAGGAG ACCCCTGAAC 
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CAGCGA6ACT CATGCTGGCT TCCTGGGGCC 1800 

AGAGTGTGTG AAAGCCCACT GACCCTGCCC 1860 

CCTTGCTTCT GCACTTACAG GCCCAGGATC 1920 

CAGGACGGGG AGATGCTGAG AGCCGAC6GG 1980 

T66G6CAGCC CHCGGAAAG AHGCCCCAG 2040 

CCCCAGGCCA GAGCTGCTTT ATTTCCGTCT 2100 

AAGGTTCCTG AAGHATGAA TAGCITTGCC 2160 

CAGGAACTTG GGAGACCCTG CAGCTCAGAC 2220 

AAGCTCGCTG GGGAACAGGG CGCnGITTC 2280 

ACCCAGTTCT GAAAGCAGAG CGGTGCTG6G 2340 

CCAAACCACC CGTGCTGGTG HCGGGGGGC 2400 

GGTGCCCCCC GTCCCCTCTG AGATCAGAAG 2460 

CCCTCCAGAG GCTCCAGGGA GGGATCCHG 2520 

CCGCACCCn GAGCCTGATC TTGTAGAC6C 2580 

CGCCTTCTCC CCAT6TCCCC TCCGTGTCCC 2640 

GATTAGCCCC TGHCCAGCC TCACCTGAAC 2700 

ACAAGATTCC AACCTGAAGT TCCCGGTGGA 2760 

CATCACAGCT TGCAGHCAT CGCAAAACAT 2820 

TCTTCATGAA ACACTGAGCT TCGAGGCHG 2880 

AGGGCAAGGA CTGAAGCTGG CTAAGCCTCC 2940 

G6CGGCT6GA GGTCTGCTCC CGTGAGTGAG 3000 

TCTGGGAAGG AAACCAGAGG CCAGAGAGCA 3060 

CTCCACCCAA GATGCTGACC AGCCAGCG66 3120 
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CCCCCT6GAA ACACCCTACA eTTCAGGGGG GAAGAG6GGC TGACCCGCCA GGTCCCTGCT 3180 

ATCAGGAGAC ATCCCCGCTA TCAGGAGAH CCCCCACCn GCTCCCGHC CCCTATCCCA 3240 

ATACGCCCAC CCCACCCCTG TGATGAGCAG TnAGTCACT TAGAATGTCA ACT6AAGGCT 3300 

TTTGCATCCC CTTTGCCAGA GGCACAAGGC ACCCACAGCC TGCTGG6TAC CGACGCCCAT 3360 

GTGGAHCAG CCAGGAGGCC TGTCCTGCAC CCTCCCTGCT CGGGCCCCCT CTGTGCTCAG 3420 

CAACACACCC AGCACCA6CA HCCCGCTGC TCCT6AGGTC TGCAGGCAGC TCGCT6TAGC 3480 

CT6AGCG6TG T66A6GGAAG TGTCCTGGGA GATTTAAAAT GT6AGAGGCG 6GAGGTGGGA 3540 

GGnGGGCCC T6T6GGCCTG CCCATCCCAC GTGCCT6CAT TAGCCCCAGT 6CTGCTCAGC 3600 

C6TGCCCCC6 CCGCAGG6GT CAGGTCACTT TCCCGTCCTG GGGHAnAT GACTCHGTC 3660 

AHGCCATTG CCATTTTTGC TACCCTAACT 66GCAGCAGG TGCHGCAGA 6CCCTCGATA 3720 

CCGACCAGGT CCTCCCTCGG AGCTCGACCT GAACCCCATG TCACCCTTGC CCCAGCCTGC 3780 

AGA66GTGGG TGACT6CAGA GATCCCTTCA CCCAAGGCCA CGGTCACATG GTTTGGAGGA 3840 

GCTGGTGCCC AAG6CAGAGG CCACCCTCCA GGACACACCT GTCCCCAGTG CTGGCTCTGA 3900 

CCTGTCCnG TCTAAGAGGC TGACCCCGGA AGIGHCCTG GCACTGGCAG CCA6CCTGGA 3960 

CCCAGAGTCC AGACACCCAC CTGTGCCCCC GCTTCTGGGG TCTACCAGGA ACCGTCTAG6 4020 

CCCAGAGG6G ACTTCCTGCT TGGCCTT6GA T6GAAGAAGG CCTCCTAHG TCCTC6TAGA 4080 

GGAA6CCACC CC6GGGCCTG A6GATGA6CC AAGTGGGATT CCGGGAACCG CGTGGCTGGG 4140 

GGCCCAGCCC GGGCTGGCTG GCCTGCATGC CTCCTGTATA AGGCCCCAAG CCTGCTGTCT 4200 

CAGCCCTCCA CTCCCTGCAG AGCTCAGAAG CACGACCCCA GG6ATATCCC TGCAGCCATG 4260 

AAGTGCCTCC TGCTTGCCCT G6GCCTGGCC CTCGCCTGTG GCGTCCAGGC CATCATCGTC 4320 

ACCCAGACCA TGAAAGGCCT GGACATCCAG AAGGHCGAG GGHGGCCGG GTGGGTGAGT 4380 

TGCAG6GCGG GCAGGG6AGC TGG6CCTCA6 AGAGCCAAGA 6A6GCTGTGA CGnGGGHC 4440 

CCATCAGTCA GCTAGGGCCA CCTGACAAAT CCCCGCTGGG GCAGCHCAA CCAGGCGHC 4500 

ACTGTCnGC AHCTGGAGG CTGGAAGCCC AAGATCCAGG TGTTGGCAGG GCTGGCnCT 4560 
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CCTGCGGCCG CTCTCTGGGG AGCAGACGGC CGTCnCTCC AGTCCTCTGC GCGCCCTGAT 4620 

nCCTCnCC TGTGAGGCCA CCAGGCCTGC TGGAAACACG CCTGCCTGCG CAGCnCACA 4680 

CGACCniGT CATCTCTTTA AAGGCCATGT CTCCAGAGTC ATGTGHGAA GHCTGGGGG 4740 

HAGTGGGAC ACAGHCAGC CCCTAAAAGA GTCTCTCTGC CCCTCAAAH mcCCACCT 4800 

CCAGCCATGT CTCCCCAASA TCCAAATGH 6CTACATGTG 6GG66GCTCA TCTGGGTCCC 4860 

TCTTTGGGn CAGTGTGAGT CTGG6GA6A6 CAHCCCCAG GGTGCAGAGT T6GGGGGAGT 4920 

ATCTCAGGGC TGCCCAGGCC 6GGGTG6GAC AGAGAGCCCA CTGTGGGGCT 6GG6GCCCCT 4980 

TCCCACCCCC AGAGTGCAAC TCAAGGTCCC TCTCCAGGTG GCGGGGACH GGCACTCCTT 5040 

GGCTATGGCG GCCAGCGACA TCTCCCTGCT GGATGCCCAG AGTGCCCCCC TGAGAGTGTA 5100 

CGTGGAGGA6 CTGAAGCCCA CCCCCGAGGG CAACCTGGAG ATCCTGCTGC AGAAAT6GTG 5160 

GGCGTCTCTC CCCAACATGG AACCCCCACT CCCCAGGGCT GTGGACCCCC CGGGGGGT6G 5220 

GGTGCAGGAG GGACCAGGGC CCCAGGGCTG GGGAAGAGGG CTCAGAGTTT ACTGGTACCC 5280 

GGCGCTCCAC CCAAGGCTGC CCACCCAGGG CTTTTTTTTT TTTTAAACn nAnAATTT 5340 

GATGC1TCA6 AACATCATCA AACAAATGAA CATAAAACAT TCATTTrTGT HACnGGAA 5400 

GGGGAGATAA AATCCTCTGA AGTGGAAATG CATAGCAAAG ATACATACAA TGAGGCAGGT 5460 

AnCTGAAH CCCTGHAGT CTGAGGATTA CAAGTGTATT TGAGCAACAG AGAGACATTT 5520 

TCATCATTTC TAGTCTGAAC ACCTCA6TAT CTAAAATGAA CAA6AAGTCC TGGAAACGAA 5580 

GCAGTGT6GG 6ATA6GCCCG T6TGAA6GCT 6CTGGGAGGC AGCAGACCTG GGTCHCGGG 5640 

CTCAAGCAGT TCCCGCTACC AGCCCT6TCC ACCTCAGACG GG6GTCAGGG TGCAGGAGAG 5700 

AGCTGGATGG GT6T6GGGGC AGAGAT66GG ACCTGAACCC CAGGGCTGCC TTTrGGGGGT 5760 

GCCTGTGGTC AAGGCTCTCC CTGACCTTTT CTCTCTGGCT TCATCTGACT TCTCCTGGCC 5820 

CATCCACCCG GTCCCCTGTG GCCTGAGGTG ACAGTGAGTG CGCCGAGGCT AGTTGGCCAG 5880 

CTGGCTCCTA TGCCCATGCC ACCCCCCTCC AGCCCTCCT6 GGCCAGCHC TGCCCCTGGC 5940 
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CCTCAGHCA TCCTGAT6AA AAT6GTCCAT 
GGGAGAACGG CGAGTGTGCT CAGAAGAAGA 
T6TTCAAGAT CGATGGTGAG TCCGGGTCCC 
GACT6TGGAC AGGHCAGGG 6GCTGGCGTC 
AT6AA6ACAC TGCCTTGACA CCTGCnCAC 
6G6GCGGTG6 CCAT6GGCAG 6TCCC66CT6 
CTCTCTTTGC T6G6G66CGG 6C66TGCTCT 
6TGCCCTCCC AGG6GTAACC GAGAGCCGH 
CCCCAGCCCG CTCCACAGCT CCTTCATCTC 
TTCACnGTT C6TCCTAAAT CCGAGATGAT 
AGG6CTGCCC HCCGCCGGG CAGCCTGGGC 
CTCTGACTGG AGGCCCT6CA CTGACTGACG 
CATCCAGCTG CACTG6GTTT GGGTGCTG6T 
CAGCC6G66C T6CCCACT6G CCTCGGTCAG 
TGCCCCGACA ATGACCCCAT CCTCAGGACG 
GCCCCACCCG AGATCGG6GG AAGCCCTATT 
6CCTCAGACT GA6TGGTGAG TGHCCCAAG 
CCAGAGHGA CAGT6AGGGC TTCCTG66CC 
GGAA6CACCA ITTCAGGGGT 6666GAT6CC 
TGGTGACCCC GGGGGAGCCC C6CTGGTC6T 
TCCCCCCCCG TTGGAACTCA CTTTTCTCCC 
CAAA6TCCTT GTGCT6GACA CC6ACTACAA 
TGCTGAGCCC GAGCAAAGCC TGGCCTGCCA 
CAG66AGACC AGCTGCGTGG TCCTTGCT6C 
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6CCAATGGCT CAGAAAGCAG CTGTCTTTCA 6000 

TTAHGCAGA AAAAACCAA6 ATCCCTGCG6 6060 

TG6GGGACAC CCACCACCCC CGCCCCCGGG 6120 

666CCCTGGG ATGCTAA6G6 ACT66TGGTG 6180 

nCCCTCCCC T6CCACCTGC CCGGGGCCH 6240 

6C6GGCTAAC CCACCA66GT GACACCCGAG 6300 

GGGCCCTCAG GCTGAGCTCA GGAGGTACCT 6360 

GCCCACTCCA GG66CCCAGG TGCCCCACGA 6420 

CT66AGACAA ACTCTGTCC6 CCCTC6CTCA 6480 

AAAGCnCGA GGGGGGGHG GGGTTCCATC 6540 

CACATCTGCC CnCGCCCCC TCAGGACTCA 6600 

CCAGGGTGCC CAGCCCAGGG TCTCTGGCGC 6660 

CCT6CCCCCA AGCTGCCCGG ACACCACA66 6720 

GGTGAGCCCC AGCTGCCCCC GCTCAGGGCT 6780 

CACCCCCCTT CCCTTGCTG6 6CAGTGTCCA 6840 

TCHGACAAC TCCAGTCCCT GGGGGA6GGG 6900 

TCCAGGAGGT GGTGGA6GGT CCTGGCG6AT 6960 

CCATGC6CCT GGCAGTGGCA GCAGGGAAGA 7020 

AGAGGCCaC CCCACCCCGT CnCGCCGGG 7080 

GGAGGGT6CT GGGG6CTGAC TAGCAACCCC 7140 

GTCHGACCG CGTCCAGCCT TGAAT6A6AA 7200 

AAAGTACCTG CTCTTCTGCA TGGAAAACAG 7260 

GT6CCTGGGT GGGTGCCAAC CCTGGCT6CC 7320 

AACAGGGGGT GGGGGGTGGG AGCHGATCC 7380 
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CCA6GAGGAG GAGGGGTGGG GGGTCCCTGA 
GGAGCCAGTC TGCT6TGGGC CTGTGGGTGG 
6AGACG6GTG GGCT6CAGAA CT6T6ACTGG 
T6AATCTAAC AGCCTTTGTr ACC6GGGAGT 
TACAAAGCCA TCmCAACT ATCACATCCT 
CCGTAGCAGT CCCACTGGGC ATTnCAGGG 
GAGTG6AGGC TCCTGGCTGT GTCAGCCGGC 
A6GT6GGGGG CAGGCTTTCC CCCTGTGACC 
GG6AGGGGAA CTAGGCCAAG GGGGAAGGGC 
AGACCACCCT GGGGAGCAGG GACTGACCCC 
GT6GACAACG AG6CCCT6GA GAAATTC6AC 
C6GCTTGCCT TCAACCC6AC CCA6CT6GAG 
66CA6GAGCC ACCCGGCCCC GGGACGACCT 
CTCCCAGGAG GAAGGGGTGG GGTGCAGCAC 
AGGCCTCTCT TCCCGAGGTG TCCAGTCCCA 
ACAGGGCAGT GCCACGTCTA GGTGAGCCCC 
CTGCCCCACG TCCTGGGCAC ACACATGGGG 
CATCAGGCCC TG6GGTCCCC CCTGTGAGAA 
TGCAGAGCTG GCTGGCCGCG TGCCACTCH 
TGACCTCCTC CAGCTCCm CAGCAGAGCT 
6GGAGGCTA6 CGGTCCTTCT CCCGAGGAG6 
6CTGGCAAGG GTCTGGCAGG TGCCCCAGGA 
GCCCGGGAGC CnGGACTCC TCTGGGGACA 



PCr/DS9SA)2648 

GTCCCGCCAG GAGAGAGTGG TCGCATACCG 7440 

CT6GG6AC6G GGGCCAGACA CACAGGCCGG 7500 

TGT6ACCGTC 6CGATGGGGC CGGTGGTCAC 7560 

TTCAAnAn TCCCAAAATA AGAACTCAGG 7620 

GAAAACAAAT 6GCAG6T6AC ATTTTCTGTG 7680 

CCCCTGTGCC AGGGGGGCGC GGGCATCGGC 7740 

CCAGGGGGAG GAAGGGACCC GGACAGCCAG 7800 

TGCAGACCCA CTGCACTGCC CTGGGAGGAA 7860 

AGGTSCTCTG GAGGGCAAGG GCAGACCTGC 7920 

CGTCCCTGCC CCATAGTCAG GACCCCGGAG 7980 

AAAGCCCTCA AGGCCCTGCC CATGCACATC 8040 

6GTGAGCACC CAGGCCCCGC CCHCCCCAG 8100 

CCTCCCA1GG TGACCCCCAG CTCCCCAGGC 8160 

CCCGTG6GGG CCCCCTCCCC ACCCCCTGCC 8220 

TCCTGACCCC CCCATGACTC TCCCTCCCCC 8280 

TGCCGGTGCC TCTGGGGTAA GaGCCTGCC 8340 

TAGGGGGTCT TGGTGGG6CC TG6GACCCCA 8400 

T6GCT6GAAG CTGGGGTCCC TCCTGGCGAC 8460 

6TGGGT6ACC TGT6TCCTGG CCTCACACAC 8520 

AAG6CTAAGT GAGCCAGAAT GGTACCTAAG 8580 

6GCTGTCCTG GAACCACCAG CCATGGAGAG 8640 

ATCACAG6GG GGCCCCATGT CCATTTCAGG 8700 

GACGACGTCA CCACCGCCCC CCCCCCATCA 8760 
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GG6GGACTAG AAGG6ACCAG GACTGCAGTC 
CCCTCCTG6G 6CTCCTGCTC TG66CA6CTT 
CTCTCCCnC TGAGTCTTTG CTGGACGAC6 
GTCTGGCTCA GAGGAT6ACA GCG6GGCT6G 
ACAACTGGGG GCCCACACAC ATCACTGCGG 
ACTCG6CAGA GACATCT6CC AGTTCACnG 
AG6ACAGAAA 6T6GAAAAT6 GCTGTCTCn 
TTGCTCAT6G ATCAATATGC CnTATGATC 
GTACCCAAAC GCACTGATCT GTCTGGCTAA 
AGAGGTCACA GTGAGAACTG TCTGCACACA 
TCAGTCCTGG TGHCAnGG AGGACTGATG 
CTGAT6TGAA GAGCTGACTC AITTGAAAAG 
AGGAGAAG6G GACGACAGAG GATGAGATGG 
GGTTTGGGTG GACTCCAGGA GTTGGTGATG 
6TTTATGGGG TCACAAAGAC TGAGTGACTG 
TACAGCAAAG TGGGGATTTT TTAGATAATA 
TATnTTATG CATACCTGAA TGCTCA6TCA 
ACCGTAGCCT TCCAGGTTTC TTCT6TCCAC 
GGTAGCCAH TCCTCCTCCA GGGGATCCTC 
TAHGGCAGG TGGATTCTTT ACCACTGTGC 
CCCACHAAT TACCAAAGCT GCTCCAAGAA 
CTGCAGAGGG TGGT6GGG6T AGACT6TGAC 
GG6CCACGT6 ACCCACAGTC CT6CAGACAG 
TCTHAAAAA AAACTGAGGT CTATTnGTG 
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ACCCTTCCTG GGACCCAGGC CCCTCCAGGC 8820 

CTCCnCACC AATAAAG6CA TAAACCTGTG 8880 

66CAGGGGGT GGAGAAGTGG TGGGGAGGGA 8940 

GATCCAGGGC GTCTGCATCA CAGTCTTGTG 9000 

CTCTTTGAAA CTTTCAGGAA CCAGGGAGGG 9060 

GAGTGHCAG TCAACACCCA AACTCGACAA 9120 

AGTCTAATAA ATAHGATAT GAAACTCAAG 9180 

CAGCCAGCCA CTACTGTCGT ATCAACTCAT 9240 

TGATGAGAGA HCCCAGTAG AGAGCTGGCA 9300 

CAGCAGAGTC CACCAGTCAT CCTAAGGAGA 9360 

HGAAGCTGA AACTCCAATG CTTTGGCCAC 9420 

ACCCTGATGC TGGGAAAGAT TGAGGGCAGG 9480 

HGGATGGCA TCACCAACAC AATGGACATG 9540 

GACAGGGAGG CCTGGCGTGC TACGGAAGCG 9600 

AACTGAGCTG AACTGAATGG AAATGA6GTA 9660 

AGAATATACA CATAACATAG TGTATACTCA 9720 

CTCAGTCGTA TCT6ACTCTG TGACCTATGG 9780 

AGAAHCTCC AAGGCAAGAA TACTGGAGTG 9840 

CCGACCCAG6 GAHGAACCG GCATCTCCTG 9900 

CACCAGGGAA GCCCGTGnA CTCTCTATGT 9960 

AAAGCCCCTG TGCCCTCTGA GCTTCCCGGC 10020 

CTGGGAACAC CCTCCCGCH CAGGACTCCC 10080 

CCGGGTAGCT CTGCTCHCA AGGCTCATTA 10140 

ACnCGCTGC CGTAACnCT GAACATCCAG 10200 
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TSCGATGGAC AGSACCTCCT CCCCAGGCCT CAGGGGCHC A6GGAGCCAG CCHCACCTA 10260 

T6AGTCACCA GACACTCGGG GGTGGCCCC6 CCHCAGGGT 6CTCACAGTC HCCCATCGT 10320 

CCTGATCAAA GAGCAA6ACC AATGACHCT TA6GAGCAAG CAGACACCCA CAGGACACT6 10380 

AGGTTCACCA GAGCT6AGCT GTCCmTGA ACCTAAAGAC ACACAGCTCT CGAAGGim 10440 

CTCTTTAATC T6GATTTAAG GCCTACTTGC CCCTCAAGAG GGAAGACAGT CCTGCATGTC 10500 

CCCAGGACAG CCACTCG6TG GCATCCGAGG CCAOTAGTA HATCTGACC GCACCCTGGA 10560 

AHAATCGGT CCAAACTGGA CAAAAACCn GGTGGGAA6T HCATCCCAG AGGCCTCAAC 10620 

CATCCTGCTT TGACCACCCT GCATCTTTTT nCTTTTATG TGTATGCATG TATATATATA 10680 

TATATATTTT TTTTTTTTTC ATTTTTrGGC TGTGCTGGCT GnCGTTGCA GTTCGGTGCG 10740 

CAGGCnCTC TCTAGTTTCT CTCTAGTCTT CTCTTATCAC AGAGCAGTCT CTAGACGATC 10800 

GACGCGT 10807 
(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
AATTCCGATC 6ACGC6TCGA CGATATACTC TA6ACGATCG ACGCGTA 47 
(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 
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(v11) IMMEDIATE SOURCE: 

(B) CLONE: BLGAMP3 

(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
T66ATCCCCT 6CCG6T6CCT CTG6 
(2) INFORMATION FOR SEQ 10 NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(v11) IMMEDIATE SOURCE: 

(B) CLONE: BLGAMP4 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AACGCGTCAT CCTCT6TGAG CCAG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vli) IMMEDIATE SOURCE: 
(B) CLONE: ZC6839 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACTACGTAGT 

(2) INFORMATION FOR SEQ ID N0:12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(v11) IMMEDIATE SOURCE: 
(B) CLONE: ZC6632 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
C6ACGCGGAT CCTACGTACC T6CA6CCATG TTTTCCATGA 6G 42 
(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(vli) IMMEDIATE SOURCE: 
(B) CLONE: ZC6627 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
AGGGCTTCGG CAAGCTTCAG G 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6521 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCCAAAGACT TACHCCaC TAGA 



1 
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(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(vlij IMMEDIATE SOURCE: 
(B) CLONE: ZC6520 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 
6CAT6AAC6T CGC6TGGTGG HGTGCTACC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(v11) IMMEDIATE SOURCE: 
(B) CLONE: ZC65I9 



(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:16: 



ACCACGCGAC GHCATGCTC TAAAACCGH 
(2) INFORMATION FOR SEQ ID NO: 17: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6518 
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(x1) SEQUENCE DESCRIPTION: SEQ ID N0:17: 
GCTGCGGGAT CCTACGTACT A6GGGGACAG GGAAGG 
(Z) INFORMATION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(v11) IMMEDIATE SOURCE: 
(B) CLONE: ZC6629 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CGACGCGAAT TCTACGTACC TGCAGCCATG AAAAGGATGG TTTCT 
(2) INFORMATION FOR SEQ ID N0:19: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6630 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:I9: 
CGACGCGAAT TCTACGTACC TGCAGCCATG AAACATCTAT TATTG 
(2) INFORMATION FOR SEQ ID N0:20: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(v11) IMMEDIATE SOURCE: 
(B) CLONE: ZC6625 

(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:20: 
GTGAGATTrr CAGATCHGT C 
(2) INFORMATION FOR SEQ ID NO: 21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(v11) IMMEDIATE SOURCE: 
(B) CLONE: ZC6626 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
AAGAATTACT GTGGCCTACC A 
(2) INFORMATION FOR SEQ ID N0:22: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) nPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vli) IMMEDIATE SOURCE: 
(B) CLONE: ZC6624 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0:22: 
GCTGCGGAAT TCTAC6TACT AHGCTGTGG GAA 
(2) INFORMATION FOR SEQ ID NO: 23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vil) IMMEDIATE SOURCE: 
(B) CLONE: ZC6514 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:23: 
CGAC6C6GAT CCTACGTACC TGCAGCCAT6 A6TTGGTCCT T6CAC 45 
(2) INFORMATION FOR SEQ ID N0:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6517 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 
GTCTCfGGTA GCAACATACT A 
(2) INFORMATION FOR SEQ ID N0:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC6516 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GGGTTTCTAG CCCTACTAGT AG 
(2) INFORMATION FOR SEQ ID N0:26: 



22 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(vll) IMMEDIATE SOURCE: 
(B) CLONE: ZC6515 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:26: 
GGGTTTCTAG CCCTACTAGT AG 
(2) INFORMATION FOR SEQ ID N0:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(XI) 



SEQUENCE DESCRIPTION: SEQ ID N0:27: 

AAGCTACGCG TCGATCGTCT AGAGTATATC GTCGACGCGT CGATCGG 
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Claims ' 



PCT/DS95/02648 



1. A method for producing fibrinogen comprising: 

providing a first DNA segment encoding a secretion 
signal operably linked to a fibrinogen Aa chain, a second DNA 
segment encoding a secretion signal operably linked to a 
fibrinogen chain, and a third DNA segment encoding a 

secretion signal operably linked to a fibrinogen 7 chain, 
wherein each of said first, second and third segments is 
operably linked to additional DNA segments required for its 
expression in the mammary glemd of a host female mammal; 

introducing said DNA segments into a fertilized egg 
of a non-human mammalian species; 

inserting said egg into an oviduct or uterus of a 
female of said species to obtain offspring carrying said DNA 
constructs ; 

breeding said offspring to produce female progeny 
that express said first, second and third DNA segments and 
produce milk containing biocompetent fibrinogen encoded by 
said segments; 

collecting milk from said female progeny; 

and recovering the fibrinogen from the milk. 

. 2. A method according to claim 1 wherein said 
species is selected from the group consisting of sheep, pigs, 
goats and cattle. 

3. A laethod according to claim 1 wherein each of 
said first, second and third DNA segments comprises an intron. 

4. A method according to claim 1 wherein the molar 
ratio of said first, second and third DNA segments is within 
the range of 0.5-1:0.5-1:0.5-1. 

5. A method according to claim 1 wherein each of 
said first, second and third DNA segments is operably linked 
to a transcription promoter selected from the group consisting 
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of casein, ^-lactoglobulin, a-lactalbumin and whey acidic 
protein gene promoters* 

6. A method according to claim l wherein said 
first, second and third DNA segments are expressed under the 
control of a ^-lactoglobulin promoter. 

7. A method according to claim 1 wherein said 
introducing step comprises injecting said first, second and 
third DNA segments into a pronucleus of said fertilized egg. 

8. A method according to claim 1 wherein said 
fibrinogen is human fibrinogen. 

9. A method according to claim 1 wherein said 
second DNA segment comprises a sequence of nucleotides as 
shown in SEQ ID NO: 3 from nucleotide 470 to nucleotide 8100. 

10. A method according to claim l wherein said 
second DNA segment comprises a sequence of nucleotides as 
shown in SEQ ID NO: 3 from nucleotide 512 to nucleotide 8100. 

11. A method of producing fibrinogen comprising: 
incorporating a first DNA segment encoding a 

secretion signal operably linked to an Aa chain of fibrinogen 
into a ^-lactoglobulin gene to produce a first gene fusion; 

incorporating a second DNA segment encoding a 
secretion signal operably linked to a chain of fibrinogen 
into a ^-lactoglobulin gene to produce a second gene fusion; 

incorporating a third DNA segment encoding a 
secretion signal operably linked to a 7 chain of fibrinogen 
into a ^-lactoglobulin gene to produce a third gene fusion; 

introducing said first, second and third gene 
fusions into the germ line of a non-human mammal so that said 
DNA segments are esqpressed in a mammary gland of said mammal 
or its female progeny and biocompetent fibrinogen is secreted 
into milk of said mammal or its female progeny ; 
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Obtaining milk from said maxnmal or its female 
progeny; and 

recovering said fibrinogen from said milk. 

12. A method according to claim 11 wherein said 
mammal is a sheep, pig, goat or bovine. 

13. A method according to claim 11 wherein each of 
said first, second and third gene fusions comprises an intron. 

14. A method according to claim 11 wherein the 
molar ratio of said first, second and third gene fusions 
introduced is within the range of 0.5-1:0.5-1:0.5-1. 

15. A method according to claim 11 wherein said 
introducing step comprises injecting said first, second and 
third gene fusions into a pronucleus of a fertilized egg and 
inserting said egg into an oviduct of a pseudopregnant female 
to produce female offspring carrying said gene fusions in the 
germ line. 

16. A method for producing fibrinogen comprising: 
providing a trsmsgenic female non-human mammal 

carrying in its germline heterologous DNA segments encoding 
Aa, Bfi and 7 chains of fibrinogen, wherein said segments are 
expressed in a xaamoDnary gland of said mammal eind fibrinogen 
encoded by said segments is secreted into milk of said mammal; 
collecting milk from said mammal; and 
recovering said fibrinogen from said milk. 

17. A method according to claim. 16 wherein said 
mammal is a sheep, pig, goat or bovine. 

18. A non-human mammalian embryo containing in its 
nucleus heterologous DNA segments encoding Aa, and 7 chains 
of fibrinogen. 
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19 • A 'transgenic non-human female mammal that 

produces recoverable amounts of human fibrinogen in its milk. 



20. A process for producing a transgenic offspring 
of a mammal comprising: 

providing a first DNA segment encoding a fibrinogen 
Aft chain, a second DNA segment encoding a fibrinogen Bp chain, 
and a third DNA segment encoding a fibrinogen 7 chain, wherein 
each of said first, second and third segments is operably 
linked to additional DNA segments recpiired for its expression 
in a mammary gland of a host female mammal and secretion into 
milk of said host female mammal « 

introducing said DNA segments into a fertilized egg 
of a mammal of a non-human species; 

inserting said egg into an oviduct or uterus of a 
female of said non-human species to obtain an offspring 
carrying said first, second and third DNA segments. 

21. A process according to claim 20 wherein said 
offspring is female. 

22. A process according to claim 20 wherein said 
offspring is male. 

23. A non-human mammal produced according to the 
process of claim 20. 

24. A non-human mammal according to claim 23 
wherein said mammal is female. 

25. A female mammal according to claim 24 that 
produces milk containing biocompetent fibrinogen encoded by 
said DNA segments. 



26. A non-human mamaaial according to claim 23 
wherein said mammal is male. 
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27. A non^human maiomal carrying in its germline DNA 
segments encoding heterologous Aoc, and y chains of 
fibrinogen, wherein female progeny of said mammal express said 
DNA segments in a mammary gland to produce biocon^etent 
fibrinogen* 

28. A mammal according to claim 27 wherein said 
mammal is female. 



29. A mammal according to claim 27 wherein said 
mammal is male. 
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